Bro, listen. I just jumped on the Ai space and learning, doing my thing you know. YOU ARE THE BEST CREATOR i have seen in this space, I am here before the 100k sub bro hear me out
I've just noticed in the demo the agent booked 9am instead of 5pm 😅 I had a look at the cause of this and did some more testing- Basically the Realtime API's TTS model can only be set to the whisper model currently. Which is not the best model at transcribing audio, so the agent must've read in 9 instead of 5 🤦♂ Some other notable issues with Realtime API: 1) The reasoning and decision making isn't the best either - even with good prompt eng, it is not following instructions and calling the tools correctly. 2) It is super expensive to run! but hopefully Like all the other openai models it would become alot cheaper over time.
Pretty amazing stuff. We started with Whisper, switched to Deepgram and ended with Assembly as it was the only one that did well with accents in my limited experience.
What's the over all costing of this set up, including api and other charges coz that the key, today , it a not a surprise or shock of what an AI and low code platforms can do. It's all about the final costing , This is the same reason that great invention never see the day light of success. So if possible, please do mention that all well in future video.
Hi Ahmed, great video! One question though. Is there any way to protect the system from the "timewasters" I mean people who just want to troll it and spend a lot of time on the phone wasting money?
its possible to prompt the Agent to professionally direct the conversation back if it goes off-course. However it's hard to mitigate against trolls that might just play along and pretend to want to schedule a call and keep going around in loops to just waste time. Another way is you can limit the call duration so it will automatically hang up after it reaches the limit.
@@gjsxnobody7534 oh if you mean just not using realtime api then yes, it would be possible to build it from scratch using deepgram groq and a voice model like 11labs. I'll try and see if I can build it
Hello, how are you? My name is Lucas and I'm from Brazil! We don't use Twilio here. Is it possible to adapt this agent to a WhatsApp agent that responds in text? And then save the information in the sheet and schedule it? I discovered your channel today, keep up the great work!
📚 Find all the Resources in my Skool community: linktw.in/QcndVy
Bro, listen. I just jumped on the Ai space and learning, doing my thing you know. YOU ARE THE BEST CREATOR i have seen in this space, I am here before the 100k sub bro hear me out
Thank you bro, I really appreciate it. Keep pushing and learning and I'll see you at the top!
Another banger. That overview chart giving me goosebumps.
My guy 👊🏽
I've just noticed in the demo the agent booked 9am instead of 5pm 😅 I had a look at the cause of this and did some more testing- Basically the Realtime API's TTS model can only be set to the whisper model currently. Which is not the best model at transcribing audio, so the agent must've read in 9 instead of 5 🤦♂
Some other notable issues with Realtime API:
1) The reasoning and decision making isn't the best either - even with good prompt eng, it is not following instructions and calling the tools correctly.
2) It is super expensive to run! but hopefully Like all the other openai models it would become alot cheaper over time.
Pretty amazing stuff. We started with Whisper, switched to Deepgram and ended with Assembly as it was the only one that did well with accents in my limited experience.
@@BennuBirdPred That's useful to know, I haven't came across assembly but will defo check it out. Thanks
What's the over all costing of this set up, including api and other charges coz that the key, today , it a not a surprise or shock of what an AI and low code platforms can do. It's all about the final costing , This is the same reason that great invention never see the day light of success. So if possible, please do mention that all well in future video.
Great Ahmed .. waiting for the next
The ending was funny, your shushing as if it’s someone lmao
great work! but still not clear why you don't do all of this in n8n? It seems perfectly capable of handling this.
I think it's because of the websocket protocol, which is needed for the openai realtime api.
Yes exactly, n8n only supports webhooks (request / response) so we aren’t able to have open live connections (websocket) for realtime communication
Hi Ahmed, great video! One question though. Is there any way to protect the system from the "timewasters" I mean people who just want to troll it and spend a lot of time on the phone wasting money?
its possible to prompt the Agent to professionally direct the conversation back if it goes off-course. However it's hard to mitigate against trolls that might just play along and pretend to want to schedule a call and keep going around in loops to just waste time. Another way is you can limit the call duration so it will automatically hang up after it reaches the limit.
can you show this with Deepgram? seems Open Ai Whisper is quite slow.
I believe currently openai only support whisper model for realtime API
@ correct. That’s why Groq and Deepgram would be faster. I just don’t know how.
@@gjsxnobody7534 oh if you mean just not using realtime api then yes, it would be possible to build it from scratch using deepgram groq and a voice model like 11labs. I'll try and see if I can build it
excellent explanation, thx
Hello, how are you? My name is Lucas and I'm from Brazil! We don't use Twilio here. Is it possible to adapt this agent to a WhatsApp agent that responds in text? And then save the information in the sheet and schedule it? I discovered your channel today, keep up the great work!
Hey man, yes that's possible with N8N too. Actually, my next video shows this, so keep an eye out in next couple days
Is there any alternative for realtime api?
Yeah you could use VAPI
I'd love to be able to get this working and get in your skool, but USD is very expensive for me, as I live in Brazil. Sad, but true.
I know, I wish skool did regional pricing. I just checked and it doesn’t seem to be the case unfortunately.
hey, i am a none coder, is it worth it for me to register to your community, or should i need coding knowledge? thanks
Hey, no coding experience needed. My aim is to help non technical people from 0 to building these kinda systems!
Can you provide code of this video ?
you'll find all the code and templates inside the skool community: www.skool.com/ai-business-accelerator/about
Phenomenal