Best of CES 2025

Transformers (how LLMs work) explained visually | DL5

AI Agents Explained Like You're 5 (Seriously, Easiest Explanation Ever!)

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

🔴LIVE กัมพูชา vs ติมอร์-เลสเต | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

5x NEW Voices for OpenAI Realtime API: More Dynamic, More Expressive, More Natural

Bart Slodyczka

มุมมอง 2 695

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 10 ม.ค. 2025

ความคิดเห็น • 18

@ClaudiusGramse หลายเดือนก่อน
Hey Bart, I appreciate the time and effort you put into this and the real-time api videos. I believe that the OpenAI voices are not build really for the customer service use case. Perhaps you could look into retellAI or BlandAi as a sort of continuation of you series on Real-time voice assistance..?! I believe that their voice would be much more suitable for the customer service use cases. Anyway thanks for all the effort. Great stuff!
@BartSlodyczka หลายเดือนก่อน
Thank you legend 🙏 Good suggestion on retellAI - I've been hearing more and more about this and could be a really interesting next step. Thanks for the support my man!
@ixtc4233 2 หลายเดือนก่อน
Thanks for covering the new voices Bart. I do like the Sage voice also. I'm testing to use for my business. I have different twilio numbers for each ad so a different prompt relative to that ad. I need to test asking the assistant to recognize the language of the caller and reply in that language back. Lots of Spanish speakers in my area.
@BartSlodyczka 2 หลายเดือนก่อน
My pleasure legend :) One thing I didn't show in my tutorials, but might be useful for you; go to this link: platform.openai.com/docs/api-reference/realtime-client-events/session/update
You'll see the configuration (the code on the RHS) and that is similar to what we set in the previous replit code. One thing I don't set in my code is the "instructions" - which by default are set to something like "Your knowledge cutoff is 2023-10. You are a helpful assistant." It would be good to test passing through custom instructions here too (seperate from the main prompt we use in the code)
Hope this helps and good luck! 💪
@Daniel530KTM 2 หลายเดือนก่อน
For business apps, I like the approach where the voice is synthetic enough for the user to instantly know they're talking to a computer. Else for real-human sounding voices (like on the NotebookLM level) there ought to be a disclosure. Or, make it a live call, like a contact center call, where it's AI lead but human supervised. Most businesses want that feature anyway, like, the ability to pick up a call mid-voicemail. But yeah, as for the currently available realtime api voices, they seem a bit too theatrical.
@BartSlodyczka 2 หลายเดือนก่อน
Excellent breakdown, I actually don't have much experience with other AI voices and was impressed by these v2 voices from the original 3x v1. I'll checkout NotebookLM to start building a better baseline. and I absolutely love the description of "theatrical" - hit the nail on the head! Thanks for the nuance and recommendation 💪
@harchitb 2 หลายเดือนก่อน
the only benefit of realtime is it streams audio inputs directly instead of text tokens. So it uses its own voice synthesis. Until they can match eleven labs' voices, theres not much utility. The whole point of this tech for it to sound as close to a natural human as possible.
And we gotta wait for prices to drop too. But i'm def gonna be selling this to businesses when it meets these things
@BartSlodyczka 2 หลายเดือนก่อน ⁺¹
Agree with your points, hopefully within the next 3-6 months the ai voice landscape picks up (especially for OpenAI as I love the ecosystem). Keep it up man!
@victorvanvas 2 หลายเดือนก่อน
😎
@toromanow 2 หลายเดือนก่อน
Pozdrowienia z Texasu
@BartSlodyczka 2 หลายเดือนก่อน
Pozdrawiam!
@musumo1908 2 หลายเดือนก่อน ⁺¹
They totally suck unless you’re making a cartoon! 😂
@BartSlodyczka 2 หลายเดือนก่อน
Yeah some are cartoon-y but that's a cool use case! What platform has good voices? I haven't explored much just yet 💪
@eatfrenchtoast 2 หลายเดือนก่อน
@@BartSlodyczkaelevenlabs using a custom made voice is probably best. But it's not realtime with OpenAI behind it.
@MaliRasko 2 หลายเดือนก่อน ⁺¹
Let me help you ... they all suck for the use case you were trying it on. They all sound like they are reading a book or ..like a voice over for the cartoon ..overly theatrical. This can't be used in any real world context.
@BartSlodyczka 2 หลายเดือนก่อน ⁺²
Fair enough, can you link voices that you think do a great job so that myself and other viewers can check them out?
@handfuloflight 2 หลายเดือนก่อน
@@BartSlodyczka is there no way to use our own voices?

ต่อไป

เล่นอัตโนมัติ

Best of CES 2025

Best of CES 2025

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

AI Agents Explained Like You're 5 (Seriously, Easiest Explanation Ever!)

AI Agents Explained Like You're 5 (Seriously, Easiest Explanation Ever!)

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

🔴LIVE กัมพูชา vs ติมอร์-เลสเต | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

🔴LIVE กัมพูชา vs ติมอร์-เลสเต | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

How Strong Is Tape?

How Strong Is Tape?

Microservices are Technical Debt

Microservices are Technical Debt

How might LLMs store facts | DL7

How might LLMs store facts | DL7

OpenAI's Realtime API Upgrades Siri with Cursor AI Integration

OpenAI's Realtime API Upgrades Siri with Cursor AI Integration

OpenAI Realtime API - The NEW ERA of Speech to Speech? - TESTED

OpenAI Realtime API - The NEW ERA of Speech to Speech? - TESTED

Claude has taken control of my computer...

Claude has taken control of my computer...

Moshi voice agent is finally open-sourced | *WEIRD but HUGE POTENTIAL*

Moshi voice agent is finally open-sourced | *WEIRD but HUGE POTENTIAL*

Generative Model That Won 2024 Nobel Prize

Generative Model That Won 2024 Nobel Prize

Build an AI Voice app in 15 min without coding. Step by Step Tutorial. Bubble OpenAI Realtime API

Build an AI Voice app in 15 min without coding. Step by Step Tutorial. Bubble OpenAI Realtime API

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

ช้างศึกโดนก่อน ไล่ยิงคืนสิงคโปร์ ทะลุน็อคเอาท์

กินขนมมั้ยจ้ะน้อง หนมน้า😝

กินขนมมั้ยจ้ะน้อง หนมน้า😝

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

🔴 LIVE : ถ่ายทอดสด การออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

#อึ้ง!เหลือจะเชื่อ!ไทยพลิกนรกดับสิงคโปร์คาบ้าน ทะลุเข้ารอบรองชนะเลิศ! คารวะอิชิอิโคตรการเปลี่ยนแปลง!

หัวหน้าแก๊งพาลูกสาวไปกินไก่ทอด เจอกลุ่มนักเลงหาเรื่อง เลยจัดการพวกนั้นจนพ่ายแพ้

หัวหน้าแก๊งพาลูกสาวไปกินไก่ทอด เจอกลุ่มนักเลงหาเรื่อง เลยจัดการพวกนั้นจนพ่ายแพ้

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

🔴LIVE โหนกระแส บาร์โฮสสะเทือน!!! "สุนิสา" อาละวาดไล่หลอกเงิน

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท