Use Open Source LLMs in AutoGen powered by Fireworks AI, without GPU/CPU

Exploring the fastest open source LLM for inferencing and serving | VLLM

GraphRAG with Ollama - Install Local Models for RAG - Easiest Tutorial

BTS of this trend with my scarry friends 😳| Andra Gogan

ช่วยผมหาแมว ใน มายคราฟ!

ไฮไลท์ฟุตบอลชิงแชมป์อาเซียน รุ่นอายุไม่เกิน 19 ปี 2024 | ทีมชาติบรูไน พบ ทีมชาติไทย

How to Use Open Source LLMs in AutoGen Powered by vLLM

Yeyu Lab

มุมมอง 5 044

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 25 ก.ค. 2024
In this video, I would like to talk about creating agents in AutoGen with Open Source LLMs.
USEFUL LINKS:
Colab notebook for AutoGen w/ GPT-4 - colab.research.google.com/dri...
Colab notebook for AutoGen w/ Phi-2 - colab.research.google.com/dri...
Tutorial on Medium: levelup.gitconnected.com/addi...
AutoGen Docs: microsoft.github.io/autogen
vLLM Docs: docs.vllm.ai/en/latest/models...
MY CONNECT:
Buy me a coffee - ko-fi.com/yeyuh
Business Inquiries - wenbo.huang@yeyulab.com
X: x.com/Yeyu2HUANG
Discord - / discord
Email Subscription - yeyu.substack.com/
Exclusive service - ko-fi.com/yeyuh/tiers
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 19

@JDWilsonJr 7 หลายเดือนก่อน
This is spot on. Thank you for making the video and explaining so well.
@yeyulab 7 หลายเดือนก่อน
Thanks
@Beenee_AI 7 หลายเดือนก่อน ⁺¹
Great ! You are really good at what you do!
@yeyulab 7 หลายเดือนก่อน
Thank you!
@jaoltr 7 หลายเดือนก่อน
Terrific video! Thank you for sharing your knowledge.
@yeyulab 6 หลายเดือนก่อน
Glad it was helpful!
@joeclacher445 7 หลายเดือนก่อน
Unreal video! Looking forward to testing various models instead of GPT!
@yeyulab 7 หลายเดือนก่อน
Thanks, would like to see the performances as well.
@truliapro7112 7 หลายเดือนก่อน
How to use autogen with aws bedrock models ?
@KodandocomFaria 7 หลายเดือนก่อน
Is it possible to use powerinfer instead of vllm? If possible which one would be faster ? Perhaps a good video to make by comparing those two inference tools
@yeyulab 6 หลายเดือนก่อน ⁺¹
Looks Powerinfer is a pretty new inference tool. It cannot be supported in Autogen directly right now but If you can run Uvicorn to serve its inference, there maybe a chance. Thanks for the recommendation.
@Nick_With_A_Stick 6 หลายเดือนก่อน ⁺¹
I don’t believe you have it in your youtube tag’s but you should fill our your youtube tags with things like “ vLLM tutorial” as k looked for one and came up very very short, and this would’ve been much mote useful. Thanks for the vid!
@yeyulab 6 หลายเดือนก่อน
Good suggestion, thanks!
@shubhamgarg5007 6 หลายเดือนก่อน
Hey, thanks for the in depth explanation. While its great that we can use Autogen along with open source models using vLLM, is there any chance we could use Gemini API along with autogen?
@yeyulab 6 หลายเดือนก่อน
There is an on-going branch of AutoGen working on Gemini integration. Soon you can use it I think. github.com/microsoft/autogen/tree/gemini
@shubhamgarg5007 6 หลายเดือนก่อน
@@yeyulab Yeah, I checked it but it has no commits since the last 2 weeks and I doubt its one of their top priorities as of right now. I couldn' find any online resources to use Gemini's free api with autogen either.
@yeyulab 6 หลายเดือนก่อน
Free Gemini API is really useful I agree. Let me check with their team.
@current.undone 7 หลายเดือนก่อน
thanks for sharing. Can vLLM be installed on Mac? Please help if it can as Max Studio has all the musles needed to do the heavylifting 🙂
@yeyulab 7 หลายเดือนก่อน ⁺¹
vLLM does not support MAC backend at the moment and I guess the reason is that they want to maximize the throughput of generation by V100/H100 GPUs.

ต่อไป

เล่นอัตโนมัติ

Use Open Source LLMs in AutoGen powered by Fireworks AI, without GPU/CPU

Use Open Source LLMs in AutoGen powered by Fireworks AI, without GPU/CPU

Exploring the fastest open source LLM for inferencing and serving | VLLM

Exploring the fastest open source LLM for inferencing and serving | VLLM

GraphRAG with Ollama - Install Local Models for RAG - Easiest Tutorial

GraphRAG with Ollama - Install Local Models for RAG - Easiest Tutorial

BTS of this trend with my scarry friends 😳| Andra Gogan

BTS of this trend with my scarry friends 😳| Andra Gogan

ช่วยผมหาแมว ใน มายคราฟ!

ช่วยผมหาแมว ใน มายคราฟ!

ไฮไลท์ฟุตบอลชิงแชมป์อาเซียน รุ่นอายุไม่เกิน 19 ปี 2024 | ทีมชาติบรูไน พบ ทีมชาติไทย

ไฮไลท์ฟุตบอลชิงแชมป์อาเซียน รุ่นอายุไม่เกิน 19 ปี 2024 | ทีมชาติบรูไน พบ ทีมชาติไทย

คืนนี้นอนไสย | หมู่บ้านร้าง 2/2

คืนนี้นอนไสย | หมู่บ้านร้าง 2/2

AutoGen Studio Tutorial - NO CODE AI Agent Builder (100% Local)

AutoGen Studio Tutorial - NO CODE AI Agent Builder (100% Local)

AutoGen Advanced Tutorial - Build Incredible AI AGENT Teams

AutoGen Advanced Tutorial - Build Incredible AI AGENT Teams

vLLM on Kubernetes in Production

vLLM on Kubernetes in Production

AutoGen Studio with 100% Local LLMs (LM Studio)

AutoGen Studio with 100% Local LLMs (LM Studio)

AutoGen Technique - Use `Description` Field to Manage the Conversation Between Multiple Agents

AutoGen Technique - Use `Description` Field to Manage the Conversation Between Multiple Agents

vLLM - Turbo Charge your LLM Inference

vLLM - Turbo Charge your LLM Inference

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

The cloud is over-engineered and overpriced (no music)

The cloud is over-engineered and overpriced (no music)

Using docker in unusual ways

Using docker in unusual ways

เครื่องวัดระดับพกพา #วัดระดับ #ระดับน้ำเลเซอร์ #เครื่องวัดระดับเลเซอร์

เครื่องวัดระดับพกพา #วัดระดับ #ระดับน้ำเลเซอร์ #เครื่องวัดระดับเลเซอร์

$1 vs $100,000 Slow Motion Camera!

$1 vs $100,000 Slow Motion Camera!

Rate This Smartphone Cooler Set-up ⭐

Rate This Smartphone Cooler Set-up ⭐

เครื่องถอดรหัส Enigma (9arm Animated)

เครื่องถอดรหัส Enigma (9arm Animated)

CONFIGURATION💘PERFECTA🔔para✅ SAMSUNG😊A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10,A20,A30,A50,A70 #shorts

CONFIGURATION💘PERFECTA🔔para✅ SAMSUNG😊A3,A5,A6,A7,J2,J5,J7,S5,S6,S7,S9,A10,A20,A30,A50,A70 #shorts

ทำไม ในอดีต ไทยได้รับ F16 จากสิงคโปร์

ทำไม ในอดีต ไทยได้รับ F16 จากสิงคโปร์

iPhone ถูกกว่าศูนย์ 7,000++ ของใหม่ราคาโคตรพิเศษ #houkandbank #shorts #reels #iphone11

iPhone ถูกกว่าศูนย์ 7,000++ ของใหม่ราคาโคตรพิเศษ #houkandbank #shorts #reels #iphone11

พ่อหัวร้อนทุบคีย์บอร์ดลูกพัง #bosspc

พ่อหัวร้อนทุบคีย์บอร์ดลูกพัง #bosspc