Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

แชร์
ฝัง
  • เผยแพร่เมื่อ 22 พ.ค. 2024
  • Phi-3-vision is the first multimodal model in the Phi-3 family, bringing together text and images, and the ability to reason over real-world images and extract and reason over text from images. It has also been optimized for chart and diagram understanding and can be used to generate insights and answer questions. Phi-3-vision builds on the language capabilities of the Phi-3-mini, continuing to pack strong language and image reasoning quality in a small model.
    Phi-3-vision can generate insights from charts and diagrams:
    Code Link: colab.research.google.com/dri...
    ----------------------------------------------------------------------------------------
    Support me by joining membership so that I can upload these kind of videos
    / @krishnaik06
    -----------------------------------------------------------------------------------
    ►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
    ►Llamindex Playlist: • Announcing LlamaIndex ...
    ►Google Gemini Playlist: • Google Is On Another L...
    ►Langchain Playlist: • Amazing Langchain Seri...
    ►Data Science Projects:
    • Now you Can Crack Any ...
    ►Learn In One Tutorials
    Statistics in 6 hours: • Complete Statistics Fo...
    End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
    Machine Learning In 6 Hours: • Complete Machine Learn...
    Deep Learning 5 hours : • Deep Learning Indepth ...
    ►Learn In a Week Playlist
    Statistics: • Live Day 1- Introducti...
    Machine Learning : • Announcing 7 Days Live...
    Deep Learning: • 5 Days Live Deep Learn...
    NLP : • Announcing NLP Live co...
    ---------------------------------------------------------------------------------------------------
    My Recording Gear
    Laptop: amzn.to/4886inY
    Office Desk : amzn.to/48nAWcO
    Camera: amzn.to/3vcEIHS
    Writing Pad:amzn.to/3OuXq41
    Monitor: amzn.to/3vcEIHS
    Audio Accessories: amzn.to/48nbgxD
    Audio Mic: amzn.to/48nbgxD

ความคิดเห็น • 22

  • @krishnaik06
    @krishnaik06  หลายเดือนก่อน +3

    Join my data science community discord group where we discuss many things. Happy Learning!!
    discord.gg/u7q6ZNSH

  • @KevinKreger
    @KevinKreger หลายเดือนก่อน

    It's a great model. Very useful. Thanks Krish.

  • @mishrajii5298
    @mishrajii5298 หลายเดือนก่อน +1

    Thanks, sir I have watched your videos and I learned a lot

  • @AshrafAli-yb4tl
    @AshrafAli-yb4tl หลายเดือนก่อน

    Awesome content Krish, you are really inspiring a generation who are interested in genai

  • @raph8240
    @raph8240 หลายเดือนก่อน +1

    Thank you for your contribution to the open source community. Pls make video on crewai agents creation

  • @twinklepardeshi3113
    @twinklepardeshi3113 หลายเดือนก่อน

    Amazing stuff !❤

  • @ashraf_isb
    @ashraf_isb หลายเดือนก่อน

    thanks again sir!

  • @smitparikh3969
    @smitparikh3969 หลายเดือนก่อน

    Hello Krish, thank you so much for the amazing video. Can you please make a video explaining the architecture of multimodal LLMs?

  • @maheshkuttymarar2694
    @maheshkuttymarar2694 หลายเดือนก่อน

    Hey Krish!! Please start a playlist on evaluation methods and techniques of LLM applications please.

  • @lenovo57787
    @lenovo57787 หลายเดือนก่อน +1

    Hi Krish, can you please do an end-to-end ML model or project using Kubernetes? Every company is asking about deploy, deploy, deploy and they want us to have practical experience using Kubernetes. Something more than just a basic tutorial.

  • @rishiraj2548
    @rishiraj2548 หลายเดือนก่อน

    🙏💯👍

  • @Aditya-on9ro
    @Aditya-on9ro หลายเดือนก่อน +1

    Is there any future plan to create a hugging face course

    • @krishnaik06
      @krishnaik06  หลายเดือนก่อน +1

      Yes comin up soon

  • @IdPreferNot1
    @IdPreferNot1 หลายเดือนก่อน

    Can you do the equivalent but simpler with the new Hugging face Langchain SDK?

  • @commoncats5437
    @commoncats5437 หลายเดือนก่อน

    I was stucked here if anyone guide me i will get good idea…. Thanks krish ❤

  • @moderx
    @moderx หลายเดือนก่อน

    Thanks a lot sir , I have learned much about a.i from you For 1 year almost. And I'm upgrading my pc for a.i workflow, which setup should I consider single GPU or multi GPU.

    • @moderx
      @moderx หลายเดือนก่อน

      Please guide me sir , I want to become A.I Engineer.

  • @amitguitarist2008
    @amitguitarist2008 หลายเดือนก่อน

    I think it needs A100. Is it possible to run in free GPUs

  • @ibrahimmuhammad5414
    @ibrahimmuhammad5414 หลายเดือนก่อน

    Please be sharing the link to the codes in the video

  • @RamaChandran-fc3hp
    @RamaChandran-fc3hp หลายเดือนก่อน

    Sir talk about alpha fold 3

  • @sameerjadhav5603
    @sameerjadhav5603 หลายเดือนก่อน

    5:15 its 'CAUSAL' and NOT 'Casual'