Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface
ฝัง
- เผยแพร่เมื่อ 22 พ.ค. 2024
- Phi-3-vision is the first multimodal model in the Phi-3 family, bringing together text and images, and the ability to reason over real-world images and extract and reason over text from images. It has also been optimized for chart and diagram understanding and can be used to generate insights and answer questions. Phi-3-vision builds on the language capabilities of the Phi-3-mini, continuing to pack strong language and image reasoning quality in a small model.
Phi-3-vision can generate insights from charts and diagrams:
Code Link: colab.research.google.com/dri...
----------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
/ @krishnaik06
-----------------------------------------------------------------------------------
►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
►Llamindex Playlist: • Announcing LlamaIndex ...
►Google Gemini Playlist: • Google Is On Another L...
►Langchain Playlist: • Amazing Langchain Seri...
►Data Science Projects:
• Now you Can Crack Any ...
►Learn In One Tutorials
Statistics in 6 hours: • Complete Statistics Fo...
End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
Machine Learning In 6 Hours: • Complete Machine Learn...
Deep Learning 5 hours : • Deep Learning Indepth ...
►Learn In a Week Playlist
Statistics: • Live Day 1- Introducti...
Machine Learning : • Announcing 7 Days Live...
Deep Learning: • 5 Days Live Deep Learn...
NLP : • Announcing NLP Live co...
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: amzn.to/4886inY
Office Desk : amzn.to/48nAWcO
Camera: amzn.to/3vcEIHS
Writing Pad:amzn.to/3OuXq41
Monitor: amzn.to/3vcEIHS
Audio Accessories: amzn.to/48nbgxD
Audio Mic: amzn.to/48nbgxD
Join my data science community discord group where we discuss many things. Happy Learning!!
discord.gg/u7q6ZNSH
It's a great model. Very useful. Thanks Krish.
Thanks, sir I have watched your videos and I learned a lot
Awesome content Krish, you are really inspiring a generation who are interested in genai
Thank you for your contribution to the open source community. Pls make video on crewai agents creation
Amazing stuff !❤
thanks again sir!
Hello Krish, thank you so much for the amazing video. Can you please make a video explaining the architecture of multimodal LLMs?
Hey Krish!! Please start a playlist on evaluation methods and techniques of LLM applications please.
Hi Krish, can you please do an end-to-end ML model or project using Kubernetes? Every company is asking about deploy, deploy, deploy and they want us to have practical experience using Kubernetes. Something more than just a basic tutorial.
🙏💯👍
Is there any future plan to create a hugging face course
Yes comin up soon
Can you do the equivalent but simpler with the new Hugging face Langchain SDK?
I was stucked here if anyone guide me i will get good idea…. Thanks krish ❤
Thanks a lot sir , I have learned much about a.i from you For 1 year almost. And I'm upgrading my pc for a.i workflow, which setup should I consider single GPU or multi GPU.
Please guide me sir , I want to become A.I Engineer.
I think it needs A100. Is it possible to run in free GPUs
Please be sharing the link to the codes in the video
Sir talk about alpha fold 3
5:15 its 'CAUSAL' and NOT 'Casual'