7 measurements that help minimize model risk for RAG

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 ต.ค. 2024

ความคิดเห็น • 11

  • @jojoabing1
    @jojoabing1 5 หลายเดือนก่อน +12

    Very nice informative video, enjoyed it a lot. Would have liked to see a bit more on how to calculate and implement some of these metrics though. For example how the hallucinations are quantified, since it seems to me it's a very difficult thing to measure.

  • @jrcasanov4
    @jrcasanov4 4 หลายเดือนก่อน +1

    Hi, fellow IBMer here. Congratulations on all the achievements in the Generative AI space. I've got a question: how do you calculate ROUGE, BLEU and other reference-dependent metrics when in production, where you don't have an expected example to draw from?

  • @LaurelPapworth
    @LaurelPapworth 5 หลายเดือนก่อน +2

    THANK YOU! I find your tutorials helpful and informative and … not full of fluff! xx ❤️❤️

  • @hi5wifi-s567
    @hi5wifi-s567 2 หลายเดือนก่อน

    Proper training for both human and machine for somehow is out of control, don’t you agree? How you manage that?

  • @Roy-h2q
    @Roy-h2q 5 หลายเดือนก่อน

    I havne't play around with LLM + RAG , but when i think about it , sounds like i can just use LLM and pair it with my office's wiki, then i can chat to get my information !! purrrffecto !

  • @stunspot
    @stunspot 5 หลายเดือนก่อน

    RAG is just so poorly done most places. Azure is ok. I know Microsoft's ex cto Sirosh who built their cognitive search. They are the only ones i've found who don't suck horribly. And don't even talk to me about OpenAI's Knowledge Bases or things will get vitriolic and scatalogical very quickly.

  • @amritbro
    @amritbro 5 หลายเดือนก่อน

    So much important concept to keep up to date any LLMs with the information from the internet.

  • @mmclean0
    @mmclean0 5 หลายเดือนก่อน

    Good video - I like IBM’s approach to day 2 model operations. Their automated monitoring around llms builds on their leading approach for monitoring/versioning of traditional ML models. Great stuff, Briana!

  • @amritsubramanian8384
    @amritsubramanian8384 5 หลายเดือนก่อน

    Hey then what is RAGA about

  • @samfranian7857
    @samfranian7857 5 หลายเดือนก่อน

    BLEU = (bilingual evaluation understudy)
    en.wikipedia.org/wiki/BLEU
    ROUGE = (Recall-Oriented Understudy for Gisting Evaluation)
    en.wikipedia.org/wiki/ROUGE_(metric)
    Many thanks for your wonderful video! 🙏🙏🙏