Use AutoGen with ANY Open-Source Model! (RunPod + TextGen WebUI)

Matthew Berman

มุมมอง 56 671

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 17 ต.ค. 2023
I might be obsessed with AutoGen...
In this video, I show you how to use AutoGen powered by TextGen WebUI and RunPod, which means you can use literally any open-source large language model with it, even Falcon 180b or Code LLaMA.
Enjoy :)
Join My Newsletter for Regular AI Updates 👇🏼
www.matthewberman.com
Need AI Consulting? ✅
forwardfuture.ai/
Rent a GPU (MassedCompute) 🚀
bit.ly/matthew-berman-youtube
USE CODE "MatthewBerman" for 50% discount
My Links 🔗
👉🏻 Subscribe: / @matthew_berman
👉🏻 Twitter: / matthewberman
👉🏻 Discord: / discord
👉🏻 Patreon: / matthewberman
Media/Sponsorship Inquiries 📈
bit.ly/44TC45V
Links:
Use RunPod - bit.ly/3OtbnQx
AutoGen Beginner Tutorial - • AutoGen Tutorial 🚀 Cre...
AutoGen Intermediate Tutorial - • AutoGen FULL Tutorial ...
AutoGen Fully Local - • How To Use AutoGen Wit...
AutoGen - microsoft.github.io/autogen
LMStudio - lmstudio.ai/
RunPod TextGen WebUI Template - bit.ly/3EqiQdl
Install TextGen Locally - • How To Install TextGen...
RunPod Full Tutorial - • Run ANY LLM Using Clou...
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 199

@matthew_berman 8 หลายเดือนก่อน ⁺²⁰⁵
Should I make a video testing different open-source models to see which one powers AutoGen best?
@scitechtalktv9742 8 หลายเดือนก่อน ⁺⁹
I think that would be of great utility !
@1-chaz-1 8 หลายเดือนก่อน ⁺²
Yes!
@Jimmy_Sandwiches 8 หลายเดือนก่อน ⁺⁴
That would be a tremendous help Matt - I was actually thinking of the ability to have specific agents using different models due to type and complexity of their roles - (one role could just have access to a business data model to keep things tight in certain areas) would really be a massive bespoke powerhouse
@stickmanland 8 หลายเดือนก่อน
Definitilily! 😉
@Techonsapevole 8 หลายเดือนก่อน ⁺²
i'd like a real useful usecase of autogen
@simkjels 8 หลายเดือนก่อน ⁺⁵⁴
I work at a law firm, and I have set up AutoGen group chat to simulate a legal team to solve tasks. The team gather legal information and argue legal matters between agents to come up with a multiple scenarios that a virtual judge finally rate each of the suggested solutions. I tried it on previous exams from law school and compared AutoGen's output to the exam evaluation, and it is staggering how well it performs.
@jesserigon 8 หลายเดือนก่อน ⁺⁸
is this a public repo? have you posted it to the examples chat in autogen discord? would be awesome to see.
@JustMaier 8 หลายเดือนก่อน ⁺⁵
I’d also like to see this. I want more example of people using autogen
@howlingdakota 8 หลายเดือนก่อน ⁺⁴
This is a great idea, would love to see it!
@neoblackcyptron 8 หลายเดือนก่อน ⁺¹
you should productize this and sell it to other law firms. Don't give out the code for free to freeloaders.
@jesserigon 8 หลายเดือนก่อน ⁺²
@@neoblackcyptron Im sure someone will regardless of this guy does, but to regard the opensource community as freeloaders is wierd. its the backbone of the entire Internet age imo. he's would need to use use like 1500 different foss projects (if you've ever seen a dependency tree) to productize his work.
@marcfruchtman9473 8 หลายเดือนก่อน ⁺¹⁰
Your ability to parse these install instructions and organize them into a video that we can actually follow is amazing.
Thank you for making these videos!
@luisortega3090 8 หลายเดือนก่อน ⁺⁴
Is anyone else continuously getting 502 gateway errors when they finish configuring the pod in the web UI? I've tried it on two different machines while using both Mistral 7b and Dolphin Mistral 7b
@IvanGabriele 8 หลายเดือนก่อน ⁺⁸
Thank you so much for the shout-out Matthew 😊! Amazing video and well-explained tutorial as usually! As I told you in private, even as a software engineer, you were the first one I watched and you helped me learn so much during my first steps into the AI & LLM world. Hopefully we'll have more amazing discoveries to share 😉.
@matthew_berman 8 หลายเดือนก่อน ⁺²
Thank you, Ivan!
@Q9i 8 หลายเดือนก่อน ⁺¹⁶
This will be INSANE! Can't wait to see what all people make from this.
@koliux1 8 หลายเดือนก่อน ⁺¹⁴
things that I think are a must for Autogen to take off are :
1) how well if at all it can push to github
2) iterate on the github
3) Embeddings and Vector DB like Supabase
to store all prompts so it does not deviate too much from the development of the coding project :/ ( but maybe I missed that part )
@JonathanPohlner 8 หลายเดือนก่อน ⁺³
keep up the good work! loving the autogen series! a wizard an an assistant and a completer walk into a bar
@MrBowmanMakes 8 หลายเดือนก่อน
you're on fire! I've learned so much about autogen from you and really appreciate your clear and focussed tutorials. Thanks Matthew!
@zyxwvutsrqponmlkh 8 หลายเดือนก่อน ⁺²
2:23 It is not completely uncensored, however an effort was made and with proper instructions you can mostly avoid censorship that it still tries to do to it's output. This was a cencored model which fine tuning efforts were made to reverse the censorship, it was not 100% successful but it was a good effort and it is substantially more useful.
@OriginalRaveParty 8 หลายเดือนก่อน ⁺⁸
I never quite figured out how to get multiple agents set up in VS Code, running mistral 7b locally with autogen. I configured assistant as name "Coder", and then assistant2 as name "Checker" and tried to get Coder to pass all his work to Checker to verify his work, but instead it all came back to me as User Proxy. Would be great to see a 5 agent example, like a little dev team with a CEO, concept designer, user interface guy, coder and code checker or something similar 👍
@luisortega3090 8 หลายเดือนก่อน ⁺¹
I believe the name of the assistant object and the assigned name have to be the same
@berkesimus 8 หลายเดือนก่อน
These are great! Thanks for showing us the way mate
@frankismartinez 8 หลายเดือนก่อน ⁺¹
Worked sweet on my older Mac M1; was able to create a POC for a healthcare project… immediate industry value
@marc1190 8 หลายเดือนก่อน ⁺¹
Great content, concise and gets to the point
@xb3sox_ 8 หลายเดือนก่อน ⁺³
Thx Matthew for this incredible work, I tried it before many times with many models and Mistral was the best and light option, i faced one issue with context length limit and i hope they have a good technique to solve it
@RemessOfficial 8 หลายเดือนก่อน
this series is great keep it up! thank you!
@forcanadaru 8 หลายเดือนก่อน
You are absolutely outstanding, Mat!
@itlackey1920 8 หลายเดือนก่อน
Very cool, thanks for sharing! I am going to be trying this asap 🎉
@J3R3MI6 8 หลายเดือนก่อน ⁺¹
Dude yes.. thanks Matt 🙏🏽💎
@jwia007 8 หลายเดือนก่อน ⁺³
Your videos are amazing!
@matthew_berman 8 หลายเดือนก่อน
Thanks!
@ReLogic888 8 หลายเดือนก่อน ⁺¹
Damn, why i just found your channel now ?
however, you do a good job with this channel. Very detail, practical, and easy explanation to follow. 👍👍
@SapienSpace 8 หลายเดือนก่อน ⁺¹
Thank you for sharing, very interesting. I look forward to seeing practical use cases that demonstrates savings in time and/or money for the most people, that is the kind of application that will have "legs"!
@mikewhite6561 8 หลายเดือนก่อน ⁺¹
Can't wait for the Autogen advanced tutorial!
@samlak7102 8 หลายเดือนก่อน ⁺¹
We would love to see your projects, they must be interesting!
@phillip_jacobs 8 หลายเดือนก่อน
Super keen for that Advanced AG Tutorial!
@TheAIAndy 8 หลายเดือนก่อน
looking forward to the real usecase video thanks again Matthew!
@VidarBrekke 8 หลายเดือนก่อน ⁺⁴
A few questions:
1 - My scripts always fail because they generate more than the 8K token limit. is there a way to avoid this from happening? Can c-tags or other method be implemented?
2 - Will Autogen work with existing (large) codebases (I have a Django project i'm working on), if so how?
@austinpatrick1871 8 หลายเดือนก่อน
A helpful usecase I’ve found was with finding ongoing clinical trials that a particular patient could be a good candidate for.
This implementation was technically with autogpt (not done it with autogen yet)
@BunniesAI 8 หลายเดือนก่อน
This is awesome 😍
@weeeBloom 8 หลายเดือนก่อน ⁺²
Thanks for your tutorials, are amazing. Please, I would really like to see more Autogen tutorials, the best use cases!!!
@matthew_berman 8 หลายเดือนก่อน ⁺¹
Thanks, will do!
@user-fm3hr1lv8y 8 หลายเดือนก่อน
Man you seem younger every day it pass, keep your great job. LOVE
@JonathanStory 8 หลายเดือนก่อน ⁺⁴
Really confused about actual pricing for running on RunPod. The posted prices ($/hr) don't mean anything to me because I'm clueless about how much cpu time would be used in the real world. Is it likely to be multiples of Chat-GPT4's $20/mo? If you spend a day coding with mistral, what does that set you back?
@DanielSCowser 8 หลายเดือนก่อน
Following
@stereotyp9991 8 หลายเดือนก่อน ⁺²
Hi! The 20 Bucks per month for ChatGPT and the OPEN AI API are two different things. If you want to use the OPEN-AI API for your autogen you have to pay for every token regardless of whether you are paying for ChatGPT or not. peace
@achille_king 8 หลายเดือนก่อน ⁺²
Thank you for great work and video again! I was wondering about possibility of combine aider with autogen. For example if developer agent could use aider when the prompt was given from proxy agent?
@sfco1299 8 หลายเดือนก่อน ⁺²
Is there a reason to use text gen webui instead of LM Studio for a local execution scenario?
@OpenAITutor 8 หลายเดือนก่อน ⁺²
Just got it to work locally on my Windows, box. Thank you for the video. Um, suggestion to folks make sure you tell the bot in your system message the OS you are using. It likes to default to Linux. :) TextGen WebUI is a beast. The lm studio is too new.
@kigas24 8 หลายเดือนก่อน ⁺⁴
I've discovered autogen + langchain can work with Excel sheets. Autogen can read the columns and calculate financial ratios (I use it for finance). Really looking forward to the advanced autogen video.
@candogruyol 8 หลายเดือนก่อน ⁺³
Hi there! Is there any chance you can share this? I'm trying to do the same thing!
@ludoviclebleu 8 หลายเดือนก่อน ⁺¹
Please share that (bis).
Thx in advance!
@kigas24 8 หลายเดือนก่อน
@@ludoviclebleu search "Using Langchain with Autogen", video by DLExplorers is the video I followed. Change the Excel file + autogen prompts for your use case.
@kigas24 8 หลายเดือนก่อน
@@candogruyol search "Using Langchain with Autogen", video by DLExplorers is the video I followed. Change the Excel file + autogen prompts for your use case.
@DanielSCowser 8 หลายเดือนก่อน
Yes please!
@dudedkdk 8 หลายเดือนก่อน ⁺⁴
Hi Matthew great stuff can you maybe make a video on GDPR and data governance if you are using auto gen? Is it safe to use if you have sensitive data
@mdfarhananis8950 8 หลายเดือนก่อน
This is really good
@neokortexproductions3311 8 หลายเดือนก่อน
Thanks for the info,subscribed too.
that opening shot transition was sick, where is that from?
@ademrahal3389 8 หลายเดือนก่อน ⁺¹
Hey great video. I actually am starting to work with autogen, and local LLM's. But there is a big issue for using tools while doing so, do you have any solution for this? My first thought rn is to pass the tools as arguments and instead of running a simple generate on the models, I run an agent chain with langchain but it's not a clean solution.
@GoldenDragonFromHills 8 หลายเดือนก่อน ⁺⁶
Waiting for the advanced code generation tutorial by autogen
@weeeBloom 8 หลายเดือนก่อน
yeah, me too!!
@fuba44 8 หลายเดือนก่อน ⁺²
Runpod also offer like a "LLM as a service", where you pay as you go. You think you could cover that in a video sometime?
@ludoviclebleu 8 หลายเดือนก่อน ⁺²
Yes, that's a must!
@spinettp 8 หลายเดือนก่อน
Thankyou so much.
@saintsscholars8231 8 หลายเดือนก่อน ⁺¹
I picked up a second hand MBP Mid 2010 Corr i7.
Will this run Auto Gen locally?
@samsontan1141 7 หลายเดือนก่อน ⁺¹
Been following your instruction for so long. Do we know what is wrong if the port 5001 is never ready?
@isaiahthompkins6523 8 หลายเดือนก่อน ⁺¹
Anyone know the price differential on requests of this approach vs OpenAi?
I can only imagine it’s much cheaper.
@leosoulas5897 8 หลายเดือนก่อน ⁺¹
is it possible to have agents from different LLMs talking to each other through Autogen. For example Mistral with Openai?
@RequiemAcapella 8 หลายเดือนก่อน
Ayy, just in time! AutoGen Hyype!
@CognitiveComputations 8 หลายเดือนก่อน ⁺²
maybe i should train dolphin on falcon-180b too
@kobyshoshan 8 หลายเดือนก่อน
I also see the same problem of non-exposed port 5001 at RUNPOD pods. Maybe they block it. Tried a lot of time the instructions and always get port not ready.
@Martin-kr5nx 8 หลายเดือนก่อน ⁺¹
Can you show us how to run agents that are a mix of OpenAI as well as open source?
@heliosobsidian 8 หลายเดือนก่อน ⁺¹
Hi Matthew! thanks for shaing! may I know where can check the AutoGen advance Tutorial? is it in the substack? please let me know :) have a nice day!!!🤗
@stickmanland 8 หลายเดือนก่อน ⁺¹
Three cheers for AutoGen!!
@mitchross2852 8 หลายเดือนก่อน ⁺¹
Do a real world example deploying llms in k8s to simulated production enterprise when developers can connect to the LLM in cluster
@bitcode_ 8 หลายเดือนก่อน ⁺¹
i used autogen with openai's api key and it ran my usage to $8 in less than 10 minutes
@consig1iere294 8 หลายเดือนก่อน ⁺⁴
In your previous video you mentioned LLM Studio where GPU could be used for GGUFs. How can one use GPUs for GGUFs? Thanks!
@easolutionsllc 8 หลายเดือนก่อน
I'm reading now you should be using GPTQs for running on VRAM (GPU)
@anthanh1921 8 หลายเดือนก่อน
I got limit context window size error (~2k) is there any setting of the TextGen UI can overcome this?
@Almsoo7 8 หลายเดือนก่อน ⁺¹
How do I get my GPU to work instead of CPU. It is taking very long to run my code as I notice from task manager only my CPU is working while the GPU is idle.
@Quitcool 8 หลายเดือนก่อน
I'm waiting for the next video
@workchannel6518 8 หลายเดือนก่อน ⁺⁴
I followed the tutorial (several times, actually) and still cannot get port 5001 ready. This is needed to emulate OpenAI API. I added 5001 to the "Expose HTTP Ports (Max 10)" field in the RunPod configuration (also tried editing the pod later too), followed the instructions in the video carefully and always get "HTTP Service [Port 5001] Not Ready" in the Connection Options tab of the Connect dialog. HELP!
@CookWithShar 8 หลายเดือนก่อน
same here! Was seeing if anybody had this issue recently
@davidyoung5074 6 หลายเดือนก่อน
its no longer port 5001. 5000 is the new api port
@gauravtewari233 6 หลายเดือนก่อน
Great Video again. I am having a small issue with current setup, I tried all mentioned on video and text web UI is working fine but whenever I'm trying to connect it via API it's giving no response while autogen is working fine between agents but response is 'None'.
@huanvo6799 7 หลายเดือนก่อน
How come you did not have to put the `llm_config` in the `UserProxyAgent`? Which llm is it using by default?
@mengli7441 7 หลายเดือนก่อน
Is it possible to use Autogen with open source models that are hosted on AWS EC2 instances?
@paolojoya3847 5 หลายเดือนก่อน
Hello, would anyone understand why I can't start the service? I am new to this and I can't find a way to solve it, I need to use port 5001
"it your service runnig? check your togs or read the README"
@OpenAITutor 8 หลายเดือนก่อน ⁺¹
Kind of sad that local LLM with autogen are not really ready for primetime. I hope they get better. At the moment, we can barely even create toy projects as demonstrated with GPT-4.
@snuwan 8 หลายเดือนก่อน ⁺³
I do not know if there is a change in the template or something but I followed this video and another video exactly but could not get a the port 5001 to be working. Then I talked to Runpod discord and they have asked me to add the following environment variable to the pod
environment variable called UI_ARGS to your pod with a value of --extensions openai --api-port 5001.
Then it is working. Hopefully it will help those who might face with the same issue.
@jakeburton8374 7 หลายเดือนก่อน
where do you add that environment variable?
@napszemuvegesteknos 7 หลายเดือนก่อน
not all heroes wear capes, thank you!
@xugefu 5 หลายเดือนก่อน
under expose http ports, there is env var setting.
@AncientSlugThrower 8 หลายเดือนก่อน ⁺¹
textgen webui is janky as hell in it's presentation, but it is still my favorite interface for trying new models because it is so bleeding edge. Great video.
@takasurazeem 5 หลายเดือนก่อน
Getting this error. While testing mistral model.
Error occurred while processing message: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
@omountassir 8 หลายเดือนก่อน
We would love to see your personal project on AutoGen! Mine is about : Leveraging AutoGen to craft an AI-driven intelligent solution for optimized pharmaceutical inventory management. I think its certainly too ambitious, I would love your advice.
@cristian15154 8 หลายเดือนก่อน ⁺¹
looks great, thanks.
but, wow if i have to do it from scratch for local use, it's kind of complicated, cause you will bump with many issues...
@moneyclip8772 8 หลายเดือนก่อน
So, could we do this with petals too?
@MatichekYoutube 8 หลายเดือนก่อน
Only thing missing is adding "langchain" of llamaindex to talk to database (csv, pdfs etc)
@BlayneOliver 8 หลายเดือนก่อน ⁺¹
Would you consider doing a video about Darkweb trained LLM's? I know of DarkBERT so far, but none others... I'd love to see that level on uncensored available
@LAVolAndy 8 หลายเดือนก่อน ⁺¹
What are the differences between RunPod and LMStudio? Why did you go away from LMStudio?
@ozsibe 8 หลายเดือนก่อน
or real world use case, i personally need to categorize my product categories to google product category structure. would love to be able to setup that on my PC.
@dflaggvt 8 หลายเดือนก่อน ⁺¹
It seems like the template is failing to run openai on port 5001
@MrAcarlo 8 หลายเดือนก่อน
Bravissimo!!!!🎉
@peterm9893 8 หลายเดือนก่อน ⁺²
it has missing functions though ... Fastchat does better in terms of api, but has issues with cpu offloading
@attilavass6935 8 หลายเดือนก่อน ⁺¹
Can Autogen be used with LLMs from Huggingface? Like in Langchain...
@kel78v2 3 หลายเดือนก่อน
Could you do more videos on autogen studio?
@mohamedbadawy7473 8 หลายเดือนก่อน
Can the TextGen WebUi run with the multiple autogen agents?
@joeyda3rd 8 หลายเดือนก่อน ⁺¹
Build an app. Your personal project would be great. Need to see how configure the agents. Code llama please!
@PigOnPCIn4K 8 หลายเดือนก่อน
I just installed code llama on textgen ui but whenever I try running it with Transformers model loader I get lots of traceback errors, show stopped for me and Code Llama :(
@peterm9893 8 หลายเดือนก่อน
that's what I been doing for almost 3 weeks now :D
@truliapro7112 8 หลายเดือนก่อน
@matthew_berman - Can we use AWS SageMaker foundation Models wit AutoGen?
@rastinder 8 หลายเดือนก่อน ⁺¹
Do we need cuda drivers?
@rollingmaster7708 8 หลายเดือนก่อน
How do you get internet access?
@w.balazs6424 8 หลายเดือนก่อน
So do I understand correctly, with this method I don't have to pay for the chatGPT API like at all?
@geofftsjy 8 หลายเดือนก่อน
Can you show how to do this with RunPod Serverless endpoints?
Also, is there a way to secure the endpoint and set your own API token?
@henrychien9177 8 หลายเดือนก่อน ⁺¹
So we dont need an api anymore? I mean like we can run without an api? Or just use the api generate for this platform?
@matthew_berman 8 หลายเดือนก่อน
You don’t need to pay for ChatGPT anymore, either way you need an API.
@nufh 8 หลายเดือนก่อน ⁺¹
Hi, I just started to venture into the AI world about almost 2 weeks now. And I am very fascinated by the llm, ChatGPT etc. Still struggle to learn Python and to understand what Docker is. Do you have any suggestion where should I start?
@raed3620 8 หลายเดือนก่อน
Ask chatgpt.
@stevenbaert1974 8 หลายเดือนก่อน ⁺³
Why use RunPod whereas you demonstrated LMStudio locally can do this free of charge? Also Mistral 7B can easily run locally which makes that you can run AutoGen endlessly and free? Am I missing something here? What's different besides Run Pod with your previous video? I was hoping to see LMStudio with Mistral 7B and AutoGen locally working on code but I miss the point of this video where LMStudio is replaced by Run Pod (which is paid by the hour). What's the message, why should we pay for Run Pod/do that?
@ludoviclebleu 8 หลายเดือนก่อน ⁺²
LMstudio has requirements that not all of us have locally :)
@ddwinhzy 8 หลายเดือนก่อน ⁺¹
There are still a lot of local computers that are not well equipped
@RadeckWolanin 8 หลายเดือนก่อน ⁺³
Hey Matt, unfortunately Dolphin model quickly hits max context window of 2048 tokens. I've tried few different ones (Mistral-7b) but with various success. Let us know which model works the best and thanks for great content!
@matthew_berman 8 หลายเดือนก่อน
MemGPT might be a good solution to the content window issues. I’m making a video about it today.
@isitanos 8 หลายเดือนก่อน
Looks like Autogen is already adding support for memgpt-enabled agents. I guess the limitation now is finding open source models that support function calling correctly. There are a couple up on Huggingface that claim to do so.
@dawn_of_Artificial_Intellect 8 หลายเดือนก่อน
Great video now you talking 🎉
@user-pq3uz2zb9i 8 หลายเดือนก่อน
Autogen advanced video please!
@ddwinhzy 8 หลายเดือนก่อน
I got to know runpod because of AutoGen and used him.
@blakelee4555 8 หลายเดือนก่อน
Can you use llava with this?
@dewijones92 8 หลายเดือนก่อน
more please
@skunkpaste2 8 หลายเดือนก่อน ⁺¹
can you start mentioning associated costs with the videos you do please? i.e. runpod cost me $xx.xx to run through this demo
@georgegeorge9242 8 หลายเดือนก่อน
Does it have internet access?
@Phoenix-zm5gv 8 วันที่ผ่านมา
Hey man, how to run it on iPhone?

ต่อไป

เล่นอัตโนมัติ

MemGPT 🧠 Giving AI Unlimited Prompt Size (Big Step Towards AGI?)