MLOps World: Machine Learning in Production

360
73 179

Running prompts at CI does not make your GenAI app enterprise ready

32:37

The BEST component for your RAG system

44:56

Why AI apps don't work in prod: AI Reliability Survey

32:25

What It Actually Takes to Deploy GenAI Applications to Enterprises Custom Evaluation Models

48:45

Lessons learned from scaling large language models in production

40:18

From Idea to Production: AI Infra for Scaling LLM Apps

38:26

Making Enterprise GenAI Safe and Effective - Tools and Approaches

Speakers:
Rahm Hafiz, CTO, AutoAlign AI
Dan Adamson, Interim Chief Executive Officer and Co-Founder, AutoAlign AI
AutoAlign CTO Rahm Hafiz will show how different approaches (finetuning, moderation guardrails, and sidecars) can be used to deploy AI safely. Rahm will show setting up a sidecar and showing how it can be used as an automated guardrail system that dynamically interacts with LLMs to make them safe, effective, and compliant without losing efficacy and without having to retune every time your LLM changes.

มุมมอง: 82

วีดีโอ

Running prompts at CI does not make your GenAI app enterprise ready

32:37

Running prompts at CI does not make your GenAI app enterprise ready

มุมมอง 1364 หลายเดือนก่อน

Speaker: Jakob Frick, CTO, Radiant AI

44:56

The BEST component for your RAG system

มุมมอง 3934 หลายเดือนก่อน

Speaker: Jeffrey Kim, AutoRAG Lead Dev, Markr Inc. In this session, I will talk about the importance of optimization of the RAG system. And tell you how to use AutoRAG to automatically optimize the RAG system for your data briefly. It will lead you to boost RAG performance quickly and easily. There are many RAG pipelines and modules out there, but you don’t know what pipeline is great for “your...

Why AI apps don't work in prod: AI Reliability Survey

32:25

Why AI apps don't work in prod: AI Reliability Survey

มุมมอง 704 หลายเดือนก่อน

Speaker: Shreya Rajpal, CEO, Guardrails AI Despite the initial frenzy around the impact of AI on software projects, the actualized impact remains limited. This is in large part because AI has inherent variability which makes engineering orgs stumped with the dreaded question "how do I know it won't break in prod even though it works in dev". In this talk, Shreya will cover why reliability for A...

What It Actually Takes to Deploy GenAI Applications to Enterprises Custom Evaluation Models

48:45

What It Actually Takes to Deploy GenAI Applications to Enterprises Custom Evaluation Models

มุมมอง 1214 หลายเดือนก่อน

Speaker: Alexander Kvamme, CEO, Echo AI Arjun Bansal, CEO & Co-founder, Log10 Alexander Kvamme and Arjun Bansal will share Echo AI's journey in deploying their conversational intelligence platform to billion-dollar retail brands. They will discuss the challenges faced due to LLM accuracy issues, which impacted their ability to deploy at scale. The speakers will speak about the iterative prompt ...

Lessons learned from scaling large language models in production

40:18

Lessons learned from scaling large language models in production

มุมมอง 1464 หลายเดือนก่อน

Speaker: Matt Squire, CTO, Fuzzy Labs Open source models have made running your own LLM accessible many people. It's pretty straightforward to set up a model like Mistral, with a vector database, and build your own RAG application. But making it scale to high traffic demands is another story. LLM inference itself is slow, and GPUs are expensive, so we can't simply throw hardware at the problem....

From Idea to Production: AI Infra for Scaling LLM Apps

38:26

From Idea to Production: AI Infra for Scaling LLM Apps

มุมมอง 2364 หลายเดือนก่อน

Speaker: Guy Eshet, Product manager, Qwak AI applications have to adapt to new models, more stakeholders and complex workflows that are difficult to debug. Add prompt management, data pipelines, RAG, cost optimization, and GPU availability into the mix, and you're in for a ride. How do you smoothly bring LLM applications from Beta to Production? What AI infrastructure is required? Join Guy in t...

LLM Fine-Tuning for Modern AI Teams: How One E-Commerce Unicorn Cut Inference Cost by 90%

46:11

LLM Fine-Tuning for Modern AI Teams: How One E-Commerce Unicorn Cut Inference Cost by 90%

มุมมอง 1094 หลายเดือนก่อน

Speaker: Emmanuel Turlay, CEO/Founder, Airtrain AI While commercial LLMs such as GPT-4 and Claude 3 Opus offer amazing generative quality, small open-source fine-tuned models such as Mistral 7B and Phi-2/3 can offer similar performance on specific tasks, for a fraction of the cost, and with much more control. However, this has been proven to be true only when the tuning dataset is of high quali...

Function Calling for LLMs: RAG without a Vector Database

39:02

Function Calling for LLMs: RAG without a Vector Database

มุมมอง 2484 หลายเดือนก่อน

Speaker: Jim Dowling, CEO, Hopsworks In this talk, we will look at extending RAG with Function Calling to access structured/tabular data. We will look at how to enrich your tables with metadata and the expressivity of the queries that you can reasonably expect to perform well. We will examine function calling in the context of queries to the Hopsworks feature store, that supports extensive meta...

Finding training inefficiencies with CentML DeepView

40:07

Finding training inefficiencies with CentML DeepView

มุมมอง 434 หลายเดือนก่อน

Speaker: Yubo Gao, Research Software Development Engineer at CentML Inc, and PhD student at University of Toronto, CentML Inc. Performance bottlenecks and resource underutilization is a common occurrence to deep learning researchers and developers. They slow down workflows of ML developers and waste computational resources. The current ecosystems of DL profilers do not provide a developer-frien...

Evaluating LLMs and RAG Pipelines at Scale

35:25

Evaluating LLMs and RAG Pipelines at Scale

มุมมอง 3814 หลายเดือนก่อน

Speakers: Eric O. Korman, Cofounder / Chief Science Officer, Striveworks Large Language Models (LLMs) and their applications, such as Retrieval-Augmented Generation (RAG) pipelines, present unique evaluation challenges due to the often unstructured nature of their outputs. These challenges are compounded by the variety of moving parts and parameters involved, such as the choice of underlying LL...

Empowering Data Science Teams: Harnessing AI with Appen

30:14

Empowering Data Science Teams: Harnessing AI with Appen

มุมมอง 344 หลายเดือนก่อน

Speakers: Sasha McGrath, Account Executive, Appen Geoff LaPorte, Adoption Program Manager, Applied AI, Appen In an era driven by data and powered by artificial intelligence, the effectiveness of data science teams hinges upon access to high-quality data and robust collaboration tools. Our presentation unveils a comprehensive platform designed to revolutionize how data science projects are execu...

Better Chatbots with Advanced RAG Techniques

49:12

Better Chatbots with Advanced RAG Techniques

มุมมอง 3344 หลายเดือนก่อน

Speaker: Zain Hasan, Developer Advocate, Weaviate Chatbots are becoming increasingly popular for interacting with users, providing information, entertainment, and assistance. However, building chatbots that can handle diverse and complex user queries is still a challenging task. One of the main difficulties is finding relevant and reliable information from large and noisy data sources. In this ...

Enhance Cost Efficiency in Domain Adaptation with PruneMe

16:41

Enhance Cost Efficiency in Domain Adaptation with PruneMe

มุมมอง 644 หลายเดือนก่อน

Speaker: Shamane Siri, Ph.D. , Head of Applied NLP Research, Arcee.ai Our PruneMe repository, inspired by "The Unreasonable Ineffectiveness of the Deeper Layers," demonstrates a layer pruning technique for Large Language Models (LLMs) that enhances cost efficiency in domain adaptation. By removing redundant layers, we facilitate continual pre-training on streamlined models. Subsequently, these ...

Data Versioning in Generative AI: A Pathway to Cost-effective ML

35:24

Data Versioning in Generative AI: A Pathway to Cost-effective ML

มุมมอง 484 หลายเดือนก่อน

Speaker: Dmitry Petrov, CEO, DVC For 5 years we have been building DVC and we know how data versioning helps teams. The evolving Generative AI workflows are different and require an evolution of versioning workflows to accomplish Generative AI goals. This new era thrives on vast amounts of unstructured data, which include everything from images, videos, and audio, to MRI scans, document scans, ...

Building ML and GenAI Systems with Metaflow

47:12

Building ML and GenAI Systems with Metaflow

มุมมอง 1364 หลายเดือนก่อน

Building ML and GenAI Systems with Metaflow

Efficiently Fine-Tune And Serve Your Own LLMs

41:59

Efficiently Fine-Tune And Serve Your Own LLMs

มุมมอง 1114 หลายเดือนก่อน

Efficiently Fine-Tune And Serve Your Own LLMs

The Who, What, and Why of Data Lake Table Formats

47:33

The Who, What, and Why of Data Lake Table Formats

มุมมอง 764 หลายเดือนก่อน

The Who, What, and Why of Data Lake Table Formats

36:56

Private, Local AI

มุมมอง 2084 หลายเดือนก่อน

Private, Local AI

The Journey of Building a Leading Open Source LLM Security Toolkit

32:00

The Journey of Building a Leading Open Source LLM Security Toolkit

มุมมอง 1054 หลายเดือนก่อน

The Journey of Building a Leading Open Source LLM Security Toolkit

The Secret Sauce for Deploying LLM Applications into Production

31:23

The Secret Sauce for Deploying LLM Applications into Production

มุมมอง 1064 หลายเดือนก่อน

The Secret Sauce for Deploying LLM Applications into Production

Running Multiple Models on the Same GPU, on Spot Instances

32:31

Running Multiple Models on the Same GPU, on Spot Instances

มุมมอง 2034 หลายเดือนก่อน

Running Multiple Models on the Same GPU, on Spot Instances

Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

45:12

Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

มุมมอง 1094 หลายเดือนก่อน

Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

Introducing Arize-Phoenix and OpenInference

42:41

Introducing Arize-Phoenix and OpenInference

มุมมอง 4894 หลายเดือนก่อน

Introducing Arize-Phoenix and OpenInference

Mitigating RAG Hallucinations with Aporia Guardrails

24:25

Mitigating RAG Hallucinations with Aporia Guardrails

มุมมอง 1174 หลายเดือนก่อน

Mitigating RAG Hallucinations with Aporia Guardrails

28:41

LLMs From Dream to Deployed

มุมมอง 374 หลายเดือนก่อน

LLMs From Dream to Deployed

Evaluation Engineering: Iterative Strategies to Testing Prompts

28:35

Evaluation Engineering: Iterative Strategies to Testing Prompts

มุมมอง 2394 หลายเดือนก่อน

Evaluation Engineering: Iterative Strategies to Testing Prompts

Customizable RAG Workflows with your Own Data

39:07

Customizable RAG Workflows with your Own Data

มุมมอง 1224 หลายเดือนก่อน

Customizable RAG Workflows with your Own Data

Wanted: A Silver Bullet MLOps Solution for Enterprise

30:39

Wanted: A Silver Bullet MLOps Solution for Enterprise

มุมมอง 1194 หลายเดือนก่อน

Wanted: A Silver Bullet MLOps Solution for Enterprise

Evaluation Techniques for Large Language Models

1:40:21

Evaluation Techniques for Large Language Models

มุมมอง 1644 หลายเดือนก่อน

Evaluation Techniques for Large Language Models

ความคิดเห็น

@zeelbhatt7585 หลายเดือนก่อน
This is exactly what I wanted for my project
@jeromeeusebius หลายเดือนก่อน
It is possible to share the google doc that describes used in the hands-on workshop. Thanks
@nikunjbedia9398 หลายเดือนก่อน
This video caused a clash of Nikunjs in my team
@fantasyapart787 2 หลายเดือนก่อน
Nice Video, can we get the github link code for practising it
@EphremTadesse-v8x 2 หลายเดือนก่อน
This is really awesome! Thank you very much.
@techtb2923 2 หลายเดือนก่อน
Nice explanation thanks mam 👌
@stevietee3878 3 หลายเดือนก่อน
Excellent video, full of incredibly useful information, and very well presented.
@dattran6096 3 หลายเดือนก่อน
Great, Could you share the resources used for this video? Many thanks
@elenagavrilova3109 3 หลายเดือนก่อน
15:40 I'd add here a Task which is more 'main' than any other task. QA must understand what they do and why, they must understand business domain itself. Thank you for the video.
@Gerald-iz7mv 4 หลายเดือนก่อน
Hi, how to export to onnx using cuda?
@parasetamol6261 4 หลายเดือนก่อน
can you give me an example notbook to do this. in video.
@kanakorn 5 หลายเดือนก่อน
Nice, but it should be better to split into chapters, first 1 hours was setting up on AWS. Thank you.
@VineetDave-r4y 6 หลายเดือนก่อน
Amazing structured breakdown of the problem.
@mysticlunala8020 6 หลายเดือนก่อน
Hello Kartik/TH-cam Handler, I have just joined a company as a Machine Learning Engineer Intern and still a fresher. I would like to keep my Name and where I work anonymous for this specific platform. I am working on a task where I need to analyse the dataset I have been given and convert that data into text using LLM. Example Data: Date Temperature 2 Feb 30C 3 Feb 24C Example Output: Today's weather will be warmer than yesterday and a little pleasant.... <so on> The use case is a little different but this is just an example to explain what I actually want. A little more explanation: What I want is that the LLM to read the dataset completely either through an excel I have or any format like CSV and answer my queries or create a conclusion based on the dataset I gave. I would love to get some help/insights from someone as experienced as you on how I can achieve my goal. We can connect on some other platform if you are comfortable with it. You can contact me at me personal mail: rohitkhare998@gmail.com Thanks. regards, Novice ML Engineer
@maazmusa3192 7 หลายเดือนก่อน
Awesome talk. I am preparing for a privacy preserving ML interview and this was an amazing crash course. Second, for the thermal flu issue you mentioned, can't we just use FHE or SMPC like you mentioned in the slides?
@user-wh9bs9xc5y 7 หลายเดือนก่อน
Well explained!! thank you !!
@MrNewAmerican 7 หลายเดือนก่อน
For f****s sake turn the damn phone off
@MrNewAmerican 7 หลายเดือนก่อน
For f****s sake turn the damn phone off
@miguelalba2106 8 หลายเดือนก่อน
Very good tutorial, specially the MLServer part
@ydinuda 8 หลายเดือนก่อน
Helpful!
@palanisamy-dl9qe 9 หลายเดือนก่อน
Do you have demo video for this? And not able to access the github
@andaldana 9 หลายเดือนก่อน
Great talk! As suggested, we do see now more "small" LLMs trained with considerably larger amounts of tokens than the "compute-optimal” recommended by the Chinchilla scaling laws
@grzegorzknor8051 9 หลายเดือนก่อน
Great stuff. Really looking forward to more content like this! Props @AI-Makerspace
@franksommers7607 9 หลายเดือนก่อน
This talk is amazing. Completely nailed it.
@mohandutt7442 10 หลายเดือนก่อน
Repo link in description or comments will be helpful
@claude-p9c 10 หลายเดือนก่อน
Thanks for the very good overview of training distributed systems on kubernetes, would love to see more detailed information making all the pieces fit together !
@genesiscloud 10 หลายเดือนก่อน
Well done, Stefan!
@drpchankh 10 หลายเดือนก่อน
Finetuning e.g. Mistral LLM should perform way better than BERT. In practise, we typically finetune LLM model for the task.
@drpchankh 10 หลายเดือนก่อน
Great talk!
@djethereal99 10 หลายเดือนก่อน
Great talk!
@satyagadepalli5681 10 หลายเดือนก่อน
fantastic
@yafz 10 หลายเดือนก่อน
Great talk!
@manasakesagani1390 10 หลายเดือนก่อน
It is too good thank you for this wonderful workshop
@manasakesagani1390 10 หลายเดือนก่อน
Can you please let me know where can I find the presentations and note books ?
@biffboffo 11 หลายเดือนก่อน
Is there still a link somewhere to the slides?
@SanjeevKumar-dr6qj 11 หลายเดือนก่อน
I have found the link of the docs in case anyone needs it . docs.google.com/document/d/1zbPak5aDFcMgEIYbDmL_F9N0GHptvoobxP9GpQlesmk/edit
@ardiscapes4410 11 หลายเดือนก่อน
Promo`SM
@shaikbyte ปีที่แล้ว
Grate session. Thank you guys
@machinelearning3518 ปีที่แล้ว
great explaination
@machinelearning3518 ปีที่แล้ว
lack of clarity in ppt
@machinelearning3518 ปีที่แล้ว
plz take care of the clarity its really shitty
@jagadeeshmemories8760 ปีที่แล้ว
Great session
@juliocardenas4485 ปีที่แล้ว
This is intellectually beautiful and useful
@ZhengCheng 2 ปีที่แล้ว
the 1080p is the same as 360p
@sreenivasreddy6996 2 ปีที่แล้ว
👍👍🎉🎉❤️👍
@davidcodes009 2 ปีที่แล้ว
fanstastic demo! thank you so much
@deep8891 2 ปีที่แล้ว
Can you share the sample code as well ?
@Сергей-ф2м2т 2 ปีที่แล้ว
nice
@kaviaaravind3980 3 ปีที่แล้ว
Where to find the demo notebooks?
@user-tk5ir1hg7l 3 ปีที่แล้ว
These are amazing presentations but the slides are a bit blurry on all the videos on your channel, would be great if you could fix that in the future. Thank you.
@channel_panel193 3 ปีที่แล้ว
ugh why do so many of these recordings have bad audio

MLOps World: Machine Learning in Production

ความคิดเห็น