AutoGen Studio with 100% Local LLMs (LM Studio)

Prompt Engineering

มุมมอง 45 137

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 5 ก.ค. 2024
Learn how to run AutoGen Studio UI with local LLMs as agents.
🦾 Discord: / discord
☕ Buy me a Coffee: ko-fi.com/promptengineering
|🔴 Patreon: / promptengineering
💼Consulting: calendly.com/engineerprompt/c...
📧 Business Contact: engineerprompt@gmail.com
Become Member: tinyurl.com/y5h28s6h
💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
LINKS:
AutoGen UI: microsoft.github.io/autogen/b...
LM Studio: lmstudio.ai/
TIMESTAMPS:
[00:00] Introduction
[00:26] Setting Up LM Studio for Local LLMs
[00:54] Downloading and Running an Instance of LM Studio
[01:50] Serving the Model through an API Endpoint
[02:48] Starting an Instance of AutoGen Studio
[03:26] Creating Agents and Workflows in AutoGen Studio
[06:30] Testing the Workflow in the Playground
[07:35] Analyzing the Results and Issues
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 111

@engineerprompt หลายเดือนก่อน
If you are interested in learning more about how to build robust RAG applications, check out this course: prompt-s-site.thinkific.com/courses/rag
@haroldasraz 5 หลายเดือนก่อน ⁺¹
This is so amazing how much the tech progressed in just a couple of months. Thank you for making this video.
@USBEN. 5 หลายเดือนก่อน ⁺⁷
Great first step, now i would like to see practical workflows that can be used for actual tasks. Hope to see videos on that.
@engineerprompt 5 หลายเดือนก่อน ⁺⁷
Thanks, yes. more to come :)
@prestonmccauley43 5 หลายเดือนก่อน ⁺⁴
For those having issues, make sure you create each item from scratch again and add not-needed to the API key on all local calls; this should fix you up. I tried to modify it after and it failed,
@Dr_Tripper 5 หลายเดือนก่อน
I was missing one point in getting this working right yesterday, thanks.
@engineerprompt 5 หลายเดือนก่อน ⁺¹²
If you run into issues with API key, just add: "not-needed" as the API key.
@ssvfx. 5 หลายเดือนก่อน ⁺³
got it working from the default pip install autogenstudio with api key as either "" or NULL, I cant remember. it stopped working when i updated autogenstudio. "not-needed" doesnt seem to work either, so I'll try to rerun from scratch in the meantime
@nobo6687 5 หลายเดือนก่อน
@@ssvfx.this isn’t solution I updated to and stopped to work ! How to run local LLM’S then ?
@ssvfx. 5 หลายเดือนก่อน
i just reinstalled the default autogenstudio version and didnt update @@nobo6687
@negadan77 5 หลายเดือนก่อน ⁺¹
Using "" in the API key field of the created local model... Works for me
@nobo6687 5 หลายเดือนก่อน
@@negadan77 empty or literally : ””
@CelesteOnYoutube 5 หลายเดือนก่อน ⁺¹
OMG lm studio is just great. This makes thing so simple. thanks for the great video.
@danielshurman1061 5 หลายเดือนก่อน
Very useful video on an important improvement in how we can use Open Source LLMs. More good progress your moves to document how to create a winning Local GPT
@nazihfattal974 5 หลายเดือนก่อน
Good one. Thanks!
Quick one: when you publish the workflow, it is supposedly publicly available. Is there a way to share the published workflow with others? Any possibility of making cosmetic changes to the UI?
I been creating my own UI using streamlit with OpenAI assistant. a good journey to go through, but it would be faster to spin out more agents with a ready made UI.
@luigitech3169 5 หลายเดือนก่อน ⁺³
Finally, thanks. I see a bright future for local LLM and agents
@jsmythib 5 หลายเดือนก่อน
Immediately LARGE numbers of people want their own LLM's thumping away in the closet. :)
@Echo11days 5 หลายเดือนก่อน
Yessssssssssssss this what i was waiting for.
@ecofriend93 5 หลายเดือนก่อน
very cool! thank you!
@alx8439 5 หลายเดือนก่อน ⁺¹²
So after playing for several days with it I can firmly say - this whole shit (both autogen and it's UI) is long miles away from being useful for anything rather than very small things. Don't fool yourself guys, try it out with your own hands. Probably with gpt-4 it can do bit more, but with Mixtral-8x7B it's very limited in what it can do and it's very buggy.
@bigpickles 2 หลายเดือนก่อน
Skills issue
@remco805 5 หลายเดือนก่อน
Always the function calling.. but this is such an important fundamental building block.
@javi_park 5 หลายเดือนก่อน ⁺¹
great video, ty! this tool becomes super powerful if it understands the codebase and executes / writes codes on your behalf. do you know of any tools that do this yet?
@dipteshbosedb 5 หลายเดือนก่อน ⁺¹
Awesome video, exciting indeed. Looking forward to many more use cases.
@cashmoneybanks8442 5 หลายเดือนก่อน ⁺¹
I tried mistral and noticed some strange behavior. By default, the system message tells it to type TERMINATE if the task is completed. This caused it to terminate early before it successfully saved the image. If I removed this line from the system prompt, it worked, but it then continued to just print("Task completed successfully") over and over until it hit the message cap.
@user-di2ld5wd4k 5 หลายเดือนก่อน ⁺²
I am getting this error: Error occurred while processing message: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
@tijendersingh5363 5 หลายเดือนก่อน
how to intergrate open interpertor and llava. or how can we run open interpreter without using gpt 4 vision instead using local models by lm studio
@crepusculoooo 5 หลายเดือนก่อน
which open source model can we use to do functions calling¡? i got 3060 12gb and 32gb ram
@thegooddoctor6719 5 หลายเดือนก่อน
Brilliant as usual. One major thing about using LM studio - Its Painfully slow using it as an agent (Verbose Logging seems one of the bottle necks), even on my i9 13900K/64gb/4090 system. Even though AutoGen Studio seems more polished, I'm thinking that Crew with Olama is more performant when using local LLMs.......
@engineerprompt 5 หลายเดือนก่อน
That's an interesting observation. Isn't there an option to turn off the verbose? Also might be interesting to see what they are using in the background for inference.
@yorkie4k 5 หลายเดือนก่อน
@@engineerprompt LMStudio uses llama.cpp as it's backend.
@rgm4646 5 หลายเดือนก่อน ⁺¹
I get this error "title":"'messages' array must only contain objects with a 'content' field that is not empty". I have tried several different models.
@MrMoonsilver 5 หลายเดือนก่อน
Hey, what about havin an OpenAI compatible API-Endpoint from Autogen-Studio? I'd love to use that API for my projects. Do you know if this is already possible?
@henrychien9177 5 หลายเดือนก่อน ⁺³
[2024-01-17 13:55:47.798] [ERROR] [Server Error] {"title":"'messages' array must only contain objects with a 'content' field that is not empty"}
how to solve this?
is it because the system message of user is empty?
@cashmoneybanks8442 5 หลายเดือนก่อน
I'm getting this as well. Tried adding a system message and it didn't fix it
@foxxo6963 5 หลายเดือนก่อน
I got this as well (windows 11, AMD R7, RTX 3090), I solved it by toggling the "Cross-Origin-Resource-Sharing (CORS)" setting to ON in LM Studio on the Local Server page (under the server port), It warns that it is a security risk, so use it with caution.
@punishedproduct 3 หลายเดือนก่อน
Same
@Atonsha 5 หลายเดือนก่อน
I did the same installation and used the same model. It wrote the same code but did not execute it. I'm running on windows.
@new4uOnTV 3 หลายเดือนก่อน
what is the most good llm for python code generation?
@avinashjain3191 5 หลายเดือนก่อน
Great Video! Can you make another video using different LLM, & Show different use cases of it?
@engineerprompt 5 หลายเดือนก่อน ⁺¹
Yes, more is coming :)
@sayanosis 5 หลายเดือนก่อน
Thank you so much for this. Could you also guide me how to use an LLM inference from TogetherAI to run Autogen?
3 หลายเดือนก่อน ⁺¹
for me it generates only first word of response for each of 13 responses in one blow. I tried to search for the solution, but it feels like I am the only one with this problem.
When I connect to openAI, it works perfectly. only local LM studio has this problem. But LM studio itself works normally when I connect it to SillyTawern. Or even Autogen test runs OK. Only when it comes to run workflow in playground, it has this problem. In console of LMM studio it looks like it starts to response and immediately closes it and starts again and closes it... I tried every combination of LM studio server settings. Does anyone has any suggestions please?
@SyamsQbattar 4 วันที่ผ่านมา
what is the benefit combine LM Studio & Auto Gen?
@techblock7938 5 หลายเดือนก่อน ⁺²
Nice video.
Be aware of the LM Studio licence.
@chrism3440 3 หลายเดือนก่อน
Non-commercial?
@techblock7938 3 หลายเดือนก่อน
@@chrism3440 Hm, for autogen I now see only MIT. Conda environment is not open source.
@deciopenna 3 หลายเดือนก่อน
Can't get past the The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable" error. Tried NULL, EMPTY, "", not-needed... nothing works.
@AmanBansil หลายเดือนก่อน ⁺¹
I tried Phi3 and Llama3...both of them generate no output. Agents keep saying gibberish.
@attiladeri4817 3 หลายเดือนก่อน
Hi, i did everything as per the video but I have got back the result like a converstion between the userproxy and the primary agent i.e.:
my original prompt: userproxy: how r u?
primary_assistanet: I'
userproxy: am good
primary_assistant: Glad
userproxy: to hear.
and after 12 messages the repsonse is cut
How can i solve this issue? Thank you for the help in advance
@JG27Korny 5 หลายเดือนก่อน
Auto-gen is fine-tuned for gpt API.
@henrychien9177 5 หลายเดือนก่อน
how to solve
Error occurred while processing message: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
@henrychien9177 5 หลายเดือนก่อน
i tried the not-needed in api key but same issue
@engineerprompt 5 หลายเดือนก่อน
make sure you start with a completely new workflow and add quotes "" around not-needed. Hope this helps.
@watcanw8357 5 หลายเดือนก่อน
Is there an alternative for LM studio? it’s not compatible with my Intel Mac. Thanks for the great video!
@ghostofakina8747 5 หลายเดือนก่อน ⁺¹
Yep - Ditch the Mac ;)
@dandretube 5 หลายเดือนก่อน
There are alternative tutorials using ollama
@watcanw8357 5 หลายเดือนก่อน ⁺¹
@@dandretube thanks, got it figured out and it works great.
@jayr7741 5 หลายเดือนก่อน ⁺¹
I'm interested in using AI to analyze trends and difficulty levels in UPSC exam questions. Can you share more details on how well AI can currently handle this task and any considerations I should be aware of?
@prestonmccauley43 5 หลายเดือนก่อน
You past likely want to look at something like bigquery ml for this. Depending on the total size of your dataset
@jayr7741 5 หลายเดือนก่อน
@@prestonmccauley43 can you please help me with that ? Please
@PubgSpeed-wl8yo 5 หลายเดือนก่อน
@@prestonmccauley43 What are the minimum computer specifications for connecting a large language model?
@prestonmccauley43 5 หลายเดือนก่อน
@@PubgSpeed-wl8yo - You really are going to need a colab for this most likely. You can probably explore it on at T4 GPU - which is free.
@PubgSpeed-wl8yo 5 หลายเดือนก่อน
@@prestonmccauley43 So I'll be able to hook up to the colab llama? And then I can connect to autogen?
@prestonmccauley43 5 หลายเดือนก่อน
Lm studio is the best tool I use it for everything.
@engineerprompt 5 หลายเดือนก่อน
I agree, its really good.
@alx8439 5 หลายเดือนก่อน
Jan is almost the same good and it's open source (means free of trojans and backdoors usually associated with proprietary software)
@vicioustide 5 หลายเดือนก่อน
How much does the openAI key costs to run all these chatgpt llms?
@engineerprompt 5 หลายเดือนก่อน
check it out: openai.com/pricing
@Eddie.QOLSkyBlock 5 หลายเดือนก่อน ⁺¹
Doesnt work for me, using CodeLLM or any model i get the error Error occurred while processing message: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
@Eddie.QOLSkyBlock 5 หลายเดือนก่อน ⁺¹
Fix for it, insert some fake api key
@brandonaso 5 หลายเดือนก่อน
I have the same issue .
@engineerprompt 5 หลายเดือนก่อน ⁺¹
Add "not-needed" as the API key.
@mog22utube 5 หลายเดือนก่อน ⁺¹
I received the same error while using GPT4 and now with the local LLM model. Anyone been able to resolve the issue?
Great video, thanks!
@prestonmccauley43 5 หลายเดือนก่อน
Having the same issue and i used the server model on lm studio before, but something must not be saving when adding not-needed. Im attempting to rebuild workflow agent from scratch@@mog22utube
@greatsarmad5147 5 หลายเดือนก่อน ⁺¹
I did everything you did but I cant figure out wat is the issue i keep getting
[Server Error] {"title":"'messages' array must only contain objects with a 'content' field that is not empty"}
btw i am using windows not mac
model openhermes 2 5 mistral 7B Q6_k gguf
if anyone know pls help!
@christopherwilms 5 หลายเดือนก่อน
Me too; i'm on mac and same error; i did see a discussion on autogenstudio github where same has been reported and seems like they are looking into it
@christopherwilms 5 หลายเดือนก่อน ⁺¹
finally got it working; what i did different is enter something into every field i could, all the system messages, description fields, etc. the only thing i had to leave blank was model type, as putting something there threw an exception.
@greatsarmad5147 5 หลายเดือนก่อน
@@christopherwilms seems like it worked thanks! Issue now that it hallucinates as if it was "fear and loathing in Las Vegas"😂
@iokinpardoitxaso8836 5 หลายเดือนก่อน
Agree, just filling all system prompts solved the issue. Many thanks.@@christopherwilms
@KolTregaskes 5 หลายเดือนก่อน
I get to 6:52 after following your instructions precisely and get this message: "Error occurred while processing message: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable"
I've checked and I have not referenced GPT-4 at all in the workflow. :-(
@KolTregaskes 5 หลายเดือนก่อน ⁺²
Ah, I needed to add "not-needed" as the API key. EDIT: but it still doesn't, display errors whatever I do, even if I type "Hello". :-(
@engineerprompt 5 หลายเดือนก่อน ⁺²
In my case I ran into this when I was testing but it worked without it as well. Seems like there might be a bug. Thanks for pointing it out.
@GiovaDuarte 5 หลายเดือนก่อน
Were you able to figure this out? I am getting the same error.
@KolTregaskes 5 หลายเดือนก่อน ⁺¹
@@GiovaDuarte See my reply above. Fixed the error but I didn't get much further. I gave up in the end.
@nobodyislisting 2 หลายเดือนก่อน ⁺¹
When I run local llm i get 2 word responses from both the user and the other agent.
Like
here is
the script
for the
here is
with each one talking back and forth like this? ANY IDEA? LMAO
@marcomeyer7545 5 หลายเดือนก่อน
Hi, how does licensing work if I build a web query app which uses the Gemini API to create a summary of the search results. If I use the free version of Gemini, and a web scraper, e.g. Serp API , and it returns a a json with the web content, and I send it to Gemini, they will use it for training. So what if I scrape by accident material that has not been authorized for AI training?
@engineerprompt 5 หลายเดือนก่อน
I am not sure. Probably you will need a legal advice here.
@robertmazurowski5974 2 หลายเดือนก่อน
LM Studio doesn't seem to support function\tool calling even if the model is made for this. I am still experiemnting and researching this.
@engineerprompt 2 หลายเดือนก่อน
that really useful info. I haven't looked at it but please let us know if you find something.
@robertmazurowski5974 2 หลายเดือนก่อน
@@engineerprompt Haven't found a solution yet, other than just using a "Please give me a json in this format" + pydantic object or Langchain ChatOpenAI + Output Parser.
I managed to get my langchain code working by just setting the api base to the link of LM Studio, but their official tool binding mechanism doesn't work with LM Studio.
I even tried a gorrila open functions v2 which is made for function calling, it doesn't work.
BUT
I checked fireworks AI cloud service. They hava llama 3 70b for 0.9 per Milion tokens which answers instantly and it supports the Open AI function calling (a feature added by fireworks AI) and even a complicated tasks response is immediate. This seems much better than gpt 3.5 while being 10 times cheaper than GPT4.
They dumbed down GPT4 with the newest model anyway, it doesn't catch instrcutions like it used to in the last several weeks.
Langchain has an integration with Fireworks AI so, it can be used with as a replacement for Open AI. I am not sure whether autogen is based on Langchain but if it is it should work.
I was almost going to pull a trigger on a 3090 for running local models, but without proper function calling it is useless to me.
@realehu1463 หลายเดือนก่อน
why do my agents keep repeating themselves and never solve the problem?? user proxy and another assistant just keep going back and forth saying the same stuff to eachother and they never complete the task
@elwii04 5 หลายเดือนก่อน
When trying this I got an error inside of LM Studio saying: [ERROR] Unexpected endpoint or method. (POST /v1/chat/completions/chat/completions). Returning 200 anyway
I have already tried changing the port but did not work...
@GabrielVeda 5 หลายเดือนก่อน
That address looks wrong. chat/completions is doubled.
@cashmoneybanks8442 5 หลายเดือนก่อน ⁺¹
Remove "chat/completions" from the base url
@elwii04 5 หลายเดือนก่อน
@@cashmoneybanks8442 I did it like in the video. Only used v1 in the end
@elwii04 5 หลายเดือนก่อน
Its working for me with "localhost:0201/v1" where 0201 is the port i selected
@hiteshmaurya9889 5 หลายเดือนก่อน
Now how can we run using gpu on google colab?
@engineerprompt 5 หลายเดือนก่อน
You will have to run the non GUI version.
@GAllium14 5 หลายเดือนก่อน ⁺²
Can you make a tutorial on running autogen through huggingface inference api for free??
@engineerprompt 5 หลายเดือนก่อน ⁺¹
Let me see what I can do.
@dgfilms1712 5 หลายเดือนก่อน ⁺¹
"not-needed" DOES NOT WORK
@eointolster 5 หลายเดือนก่อน ⁺¹
Content message error for me with mistral, going to try openhermes now. I'm personally sick of windows making such flawed stuff that runs perfect on Mac and rarely on windows
@CelesteOnYoutube 5 หลายเดือนก่อน
LOL I'm on windows and it works flawlessly. Maybe it's user error LOL
@Stranglewood 5 หลายเดือนก่อน
I have it running with the Bloke's mistral-7b-instruct-v0.1.Q4_K_M.gguf
@Paradoxical66 4 หลายเดือนก่อน
Way too techy for the masses. Interface is very difficult.

ต่อไป

เล่นอัตโนมัติ

AutoGen Studio 2.0 Tutorial - Skills, Multi-Agent Teams, and REAL WORLD Use Cases (NO CODE)