Watch Me Build Samantha: Full Demo & Code Breakdown | OpenAI Realtime API Conversational Agent AI

Jesús Copado

มุมมอง 7 347

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 22 ธ.ค. 2024

ความคิดเห็น • 72

@HarshSharma-vm3nm หลายเดือนก่อน ⁺³
This looks very amazing. Thanks for sharing this amazing build jesús.
Thats so nice of you to share this with complete codebase. A lot of people tend to gatekeepe it to them only.
Keep up buddy. Waiting for more vids like these. And ill also be keeping a tab on GitHub repo if there are future updates.
🙌
@jesuscopado-en หลายเดือนก่อน ⁺¹
Appreciate that very much!! :)
And I'm with you, sharing is key to progress, long live open-source haha
@catalyst9259 หลายเดือนก่อน ⁺¹
This is great, exactly what I needed, thankyou was having hard time getting tools with realtime api
@belencastillodiaz4514 หลายเดือนก่อน ⁺²
This is absolutely incredible!
@arianaponytail หลายเดือนก่อน ⁺¹
really cool stuff and i love it is showing examples and straight to the point! Thanks for sharing this!
@jesuscopado-en หลายเดือนก่อน
It's crazy yeah... Glad you liked it :)
@joeg3117 หลายเดือนก่อน
wow very impressive !! love your content, and keep it up !
@jesuscopado-en หลายเดือนก่อน
Thanks a lot! :)
@jota_moretti หลายเดือนก่อน
Imagine for a moment the potential of Samantha in fully digitalized companies with microservices architectures consumed through the tools module. It reminds me of that AI from 'I, Robot' by Isaac Asimov.
Great job, keep it up
@jesuscopado-en หลายเดือนก่อน
Yeah, the potential of that is huuuge. And great book! :D
@sergiorobles5156 หลายเดือนก่อน ⁺¹
So interactive... I love it
@EddiksonPena หลายเดือนก่อน
I love love love this and thank you for sharing, how did you build your tools ?
@rogercampbell9019 หลายเดือนก่อน ⁺³
Awesome! Would appreciate more insight into the .env setup and what API Keys are required.... for example does it have to be Azure?
@renequinones4607 หลายเดือนก่อน
it seems that several of the keys are paid sites.
@jesuscopado-en หลายเดือนก่อน
Next Python build video will show entirely to set up the .env from scratch and configure both TogetherAI's FLUX and Groq! Uploading it tomorrow!
@jesuscopado-en หลายเดือนก่อน
Some of them are free till a certain usage, but plenty for a single person. I still haven't paid a cent :D. I mention that in the comments.
@baz813 หลายเดือนก่อน
The interuptability looks really good, 'she' shuts up quickly when you start talking
@jesuscopado-en หลายเดือนก่อน
Yess, that's crazy good, and contributes a lot to the feeling of having a human-like conversation :D
@SamMyst007 หลายเดือนก่อน
Awesome!! ❤️
@jesuscopado-en หลายเดือนก่อน
Thanks! :)
@Lord_Arkham หลายเดือนก่อน
Love it! I gonna subscribe, give you a like, watch all the other videos you may have and activate all notifications!
@jesuscopado-en หลายเดือนก่อน ⁺¹
Thanks a lot!! :)
@MarkRogers-j1p หลายเดือนก่อน
Amazing welldone thanks for the upload. new subscriber here
@jesuscopado-en หลายเดือนก่อน
Appreciate that! :)
@TheYules หลายเดือนก่อน
That looks awesome Jesús ! Would it be possible to use local LLM (with Ollama for example) and local stable diffusion models (using forge webui or comfyui)?
@jesuscopado-en หลายเดือนก่อน
Yess, absolutely, if you're lucky enough to have a beefy GPU for that hehe. You'd need to change the llm instantiation that uses Groq for Ollama -> python.langchain.com/api_reference/ollama/llms/langchain_ollama.llms.OllamaLLM.html#ollamallm
@TheYules หลายเดือนก่อน
@@jesuscopado-en Thanks for the tps, will look into that :) You don't need a super beefy GPU to run all LLMs some mistral or llama models are useable on more modest configs too!
@jesuscopado-en หลายเดือนก่อน ⁺¹
@@TheYules yeah, you're right! But you gotta stick to less than 7b parameter quantized models which are not that great. I usually test them on LM Studio app on my mac, great app actually! I always find simpler to go to Groq/OpenAI APIs as I don't worry about privacy (yet) and value the better performance
@fernandodiaz8231 หลายเดือนก่อน
Thank you for sharing this interesting proposal. I would like to ask you what additional resources may be required if you want to use it in a language other than English. For example, Spanish.
@jesuscopado-en หลายเดือนก่อน
Sii, se puede en español, solo tienes que hablar en español para que te responda en español. Como con ChatGPT
@ProlVid หลายเดือนก่อน
I like your work. This is awesome. How can I build this for myself usisng the tools you provided? I watched the other video but still can't figure out the process. Where to start from?
@jesuscopado-en หลายเดือนก่อน ⁺¹
Glad you like it :)
Hhmm I'd honestly recommend to ChatGPT your way into understanding. If you're new to Python or programming, ask for the basics, then take screenshots or copy paste errors and ask for clarification, that would both help you learn and progress in the projects.
@LePsyclone หลายเดือนก่อน
Great project man. I can't get it to run though. The voice/mic button doesnt appear and i get errors. Do you have a community I can join to trouble shoot? I would love to build on this project!
@jesuscopado-en หลายเดือนก่อน
Ah dang, if it's related to the mic try searching in Chainlit open issues -> github.com/Chainlit/chainlit/issues
@Stephan808 หลายเดือนก่อน
wow amazing! But I didnt get it. you use flux for image creation which is absolute awesome! Do you run it locally? Im running it in forge on runpod. Do you have an idea, how I can connect that frum runpod to samantha?
@jesuscopado-en หลายเดือนก่อน
It's running through the TogetherAI API. Well, you could pull the repo from runpod and then open a ngrok tunnel to use it from your machine
@Stephan808 หลายเดือนก่อน
@@jesuscopado-en ahh thx! good idea!
@Throttlemouth001 หลายเดือนก่อน
Hi, getting stuck at Connected to OpenAI realtime: - the mic turns on and nothing else happens, no error codes even. API Keys set.
@Throttlemouth001 หลายเดือนก่อน
btw Together now only offers $1 of free credits.
@jesuscopado-en หลายเดือนก่อน ⁺¹
Ah dang... Dunno what to tell you. Annoying to not have the error trace yeah. Maybe the endpoint is not correctly set... I'll soon update the repo with the latest Chainlit version (not the alpha) and maybe that helps.
@Throttlemouth001 หลายเดือนก่อน
@@jesuscopado-en Cheers! :D
@soumikganguly8038 หลายเดือนก่อน
Looks like gpt-4o-realtime-preview, 2024-10-01 is available only in eastus2 and swedencentral
@KevinKreger หลายเดือนก่อน
More subs for Jesús
@jesuscopado-en หลายเดือนก่อน ⁺¹
Appreciate that! :)
@simont733 หลายเดือนก่อน
I have seen your TH-cam it's amazing. I have a question can you combine your all videos and make an AI Sementa live video when you asked her she responded that instead of audio make her a video with the expiration from the previous girl that you make emotion picture to video
@jesuscopado-en หลายเดือนก่อน
I'm glad you like my channel :)
That'd be complicated, cause the avatar video models and lip sync models are still very slow, but soon in for future for sure this will be possible!
@bouazzarochdi9676 หลายเดือนก่อน ⁺¹
Nice job! II hade an idea talking with my mother. What if we will try to improve and customize it focused on cell use for parent and grandparents to use it in their cells Mostly, don't know how to use a cell. But, with an AI named as they like. I think it will be great for them :)
@jesuscopado-en หลายเดือนก่อน
That's a great idea! And I think it'll come in the near future... Many companies are already betting on this kind of personalized AIs
@sakibshaikh429 หลายเดือนก่อน
great, exactly
@MMABeijing หลายเดือนก่อน
you look shy when you talk to her, that's cute
@jesuscopado-en หลายเดือนก่อน ⁺¹
hahahaha, wait till I configure her with Scarlett Johansson's voice :D
@SoloJetMan หลายเดือนก่อน
what's the cost breakdown for this demo?
@jesuscopado-en หลายเดือนก่อน
Zero, running on those free resources and trials I mention in the comments :D
@adrianpetrescu8583 หลายเดือนก่อน
At this time Samantha have memory ? can she remember what you talk in last chat or 3 days ago ?
@jesuscopado-en หลายเดือนก่อน
Noup, not yet, but I'll implement that soon! In a similar fashion of how ChatGPT does it now
@luisavila8421 หลายเดือนก่อน
Nice jobs ❤
@jesuscopado-en หลายเดือนก่อน
Thanks! :)
@toshailendra หลายเดือนก่อน
cool… i can see the agentic future is here
@jesuscopado-en หลายเดือนก่อน
Completely agree!!
@sambit5526 หลายเดือนก่อน
always getting "Failed to connect to OpenAI realtime:"
@jesuscopado-en หลายเดือนก่อน
Ah dang, everybody that I know has been able to replicate it, if you send me the full log trace of the error I can try to help
@motivation_guru_93 หลายเดือนก่อน
With this pace, I am sure AI Overtaking Humans era is very near.
@clapton79 หลายเดือนก่อน
Feature Request 01: Please give Scarlett Johansson's voice to it. Pretty pleeeeease.
@jesuscopado-en หลายเดือนก่อน
Absolutely, but that'd be the end of my in-person/human social life haha
@bhanukiran6262 หลายเดือนก่อน
imagine claude desktop and smantha ?
@jesuscopado-en หลายเดือนก่อน
Gotta work on that!!
@vivsh.1999 หลายเดือนก่อน
not her, the sound is a bit robotic, if you can work on voice texture this would be surreal
@jesuscopado-en หลายเดือนก่อน ⁺³
Yeah, definitely doesn't sound like Scarlett Johansson hahaha. But we'll get there!! :D
@mine4me403 หลายเดือนก่อน
well, its not going to be completely a human voice. Because it needs to wait and listen then process (NLP).. the movie Her, was just that a movie.
@vivsh.1999 หลายเดือนก่อน ⁺²
@@mine4me403 i think you haven't yet talked to openai's voice model.

ต่อไป

เล่นอัตโนมัติ

How to Build Samantha with OpenAI’s Realtime API for FREE! | Speech-to-speech AI Agent Explanation