LLM Evaluation With MLFLOW And Dagshub For Generative AI Application

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 มิ.ย. 2024
  • With the emerging of ChatGPT, LLMs have shown its power of text generation in various fields, such as question answering, translating and text summarization. Evaluating LLMs’ performance is slightly different from traditional ML models, as very often there is no single ground truth to compare against. MLflow provides an API mlflow.evaluate() to help evaluate your LLMs.
    mlflow.org/docs/latest/llms/l...
    Code:github.com/krishnaik06/MLFLOW...
    ---------------------------------------------------------------------------------------------
    Support me by joining membership so that I can upload these kind of videos
    / @krishnaik06
    -----------------------------------------------------------------------------------
    ►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
    ►Llamindex Playlist: • Announcing LlamaIndex ...
    ►Google Gemini Playlist: • Google Is On Another L...
    ►Langchain Playlist: • Amazing Langchain Seri...
    ►Data Science Projects:
    • Now you Can Crack Any ...
    ►Learn In One Tutorials
    Statistics in 6 hours: • Complete Statistics Fo...
    End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
    Machine Learning In 6 Hours: • Complete Machine Learn...
    Deep Learning 5 hours : • Deep Learning Indepth ...
    ►Learn In a Week Playlist
    Statistics: • Live Day 1- Introducti...
    Machine Learning : • Announcing 7 Days Live...
    Deep Learning: • 5 Days Live Deep Learn...
    NLP : • Announcing NLP Live co...
    ---------------------------------------------------------------------------------------------------
    My Recording Gear
    Laptop: amzn.to/4886inY
    Office Desk : amzn.to/48nAWcO
    Camera: amzn.to/3vcEIHS
    Writing Pad:amzn.to/3OuXq41
    Monitor: amzn.to/3vcEIHS
    Audio Accessories: amzn.to/48nbgxD
    Audio Mic: amzn.to/48nbgxD

ความคิดเห็น • 20

  • @krishnaik06
    @krishnaik06  หลายเดือนก่อน +11

    Subscribe if you want to become a Data Scientist :)

    • @jaisingh1292
      @jaisingh1292 หลายเดือนก่อน

      Hi Krish can you please take a look at dvc and mlflow and can they be combined and used for gen ai , the demand for LLMops is increasing. For cloud its fine but industries want on prem solution as well it would be great if you can make any video on the same

    • @DarkShadow-bq5yb
      @DarkShadow-bq5yb หลายเดือนก่อน

      can you create video for Data engineering using LLMs

    • @HimanshuGupta-ps3ib
      @HimanshuGupta-ps3ib หลายเดือนก่อน

      Great video Krish! Watching your content from USA.

  • @YorkYongYeo
    @YorkYongYeo หลายเดือนก่อน

    thanks for sharing this! just what i needed for reference. Liked and looking forward to the one for RAG evaluation

  • @surbhirohilla5139
    @surbhirohilla5139 หลายเดือนก่อน

    You deserve more appreciation for this

  • @2dapoint424
    @2dapoint424 หลายเดือนก่อน +2

    Krish, @10:15 in VS code what is that cat like icon on top right next to the run button?

  • @pratiksitapara8962
    @pratiksitapara8962 หลายเดือนก่อน

    Great video on Evals! Thanks!
    But It would be great to see or if you could make a video on Evaluating RAG on Traditional metrics? What insights we can get by using traditional evals or automatic evals? What is the current SOTA methods for RAG Evals!?

  • @GiovanneAfonso
    @GiovanneAfonso หลายเดือนก่อน

    Incredible content, thank you +1sub

  • @user-nr5mh9vz7i
    @user-nr5mh9vz7i หลายเดือนก่อน

    Hi Krish ! This is excellent video, will you be able to make a video on Azure AI Prompt flow, please?

  • @pavanpraneeth4659
    @pavanpraneeth4659 หลายเดือนก่อน

    Awesome please in future show how to integrate this with aws bedrock please

  • @prayagbrahmbhatt6375
    @prayagbrahmbhatt6375 หลายเดือนก่อน

    Need a video explanation on promptfoo, an opensource LLM evaluation library.

  • @arpitqw1
    @arpitqw1 หลายเดือนก่อน

    what is the use of daghub ?, same once can see in local tracking server.

  • @shriharinair1999
    @shriharinair1999 หลายเดือนก่อน +1

    can you from now on focus more on non paid apis?

  • @RishiRajxtrim
    @RishiRajxtrim หลายเดือนก่อน

    Good evening

  • @ashishdayal172
    @ashishdayal172 หลายเดือนก่อน

    sounds like tensorflow hub

  • @ridj41
    @ridj41 หลายเดือนก่อน

    Krish honestly the content has become a bit uninteresting tbh from the past few videos.
    Would love if you try out some new kind of projects, some advanced ones to use in our resumes using GenAi

    • @krishnaik06
      @krishnaik06  หลายเดือนก่อน +3

      brother this is really important. I usually make all the videos based on the job market :)