OpenAI Realtime API: The future of Voice AI?

แชร์
ฝัง
  • เผยแพร่เมื่อ 3 ม.ค. 2025

ความคิดเห็น • 53

  • @patrickzupanc1795
    @patrickzupanc1795 2 หลายเดือนก่อน +1

    Great video, thank you, Jannis!

  • @LucasMarquesAI
    @LucasMarquesAI 3 หลายเดือนก่อน +1

    Great video as always Jannis, let's go 🔥

  • @mikearmstrong-ai
    @mikearmstrong-ai 2 หลายเดือนก่อน

    Very informative, will start jumping in, thanks for the free resources.

  • @BrockMesarich
    @BrockMesarich 3 หลายเดือนก่อน

    Was waiting for you to release this!

  • @HenrykAutomation
    @HenrykAutomation 3 หลายเดือนก่อน

    Love its speed, unmatched by anything else out there right now!

  • @clairedubiel1
    @clairedubiel1 2 หลายเดือนก่อน

    Thanks for the helpful video Jannis!

  • @mohammedzihan7382
    @mohammedzihan7382 3 หลายเดือนก่อน +3

    For Developers, feel voice providers like VAPI wouldn't be required in near future. Directly integrate the OpenAI API, and have components like WebRTC, real time streaming, client server connection mapping, DB connections & data mapping implemented. For handling workflow management, state management, could integrate certain frameworks on top like Langraph.

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน +2

      Those platforms are already not required anymore, but I believe the realtime API will be the reason they become even more popular. Will share more on that soon.

  • @7_Tom
    @7_Tom 3 หลายเดือนก่อน +4

    Great video as always! Since you are probably in contact with the Vapi team... Can you estimate how long it will take until this is implemented? Thanks.

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน +2

      I’m not quite sure, but I assume we should see something being released soon.

  • @_arav_patel_
    @_arav_patel_ 2 หลายเดือนก่อน +1

    Great video. I wonder what the future will be like with Voice AI becoming this realistic. How long do you think it will take for Vapi to implement this? (few days, weeks, months?)

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน +1

      I expect weeks max. :)

  • @naryanzaninja7367
    @naryanzaninja7367 2 หลายเดือนก่อน

    What are your plans Jannis? Run the agency long term, or switch completely to saas, or voice ai education, or something else?

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน

      I haven’t even started with voice AI education.
      Honestly, for now I’m happy helping others build out extremely powerful systems, but the educational route might certainly be interesting one I see the need for it

  • @moatazelkersh6129
    @moatazelkersh6129 2 หลายเดือนก่อน

    What a great video! Thanks so much for doing the work and providing us with the template for free. If you don’t mind me asking, how can I reduce my costs with Twilio and set up an open-source phone system to act as the call gateway? Another thing I was planning to implement WebRTC as it has the functionality to reduce Eco and noise reduction in case someone will call in a loud environment!

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน

      I think OpenAI handles the noise reduction part by themselves. If you're referring to SIP trunking, you most likely need to see how you can do the connection. Not every platform allows you to add a SIP URL to it, sometimes it's the other way around.
      If you want to try it, use something like Zoiper

  • @thereviewer5562
    @thereviewer5562 3 หลายเดือนก่อน

    You are as always authentic in your opinion. It is exciting thing for someone who is beng introduced to this voice stuff with ai for the first time. What do you thinkis the basic thing a beginner can learn in low code development ? What is the skill that moves the needle?

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน +1

      Understanding the concept and foundations.
      I think that’s the most important thing.
      Try some of my examples so you have a working solution, and then try to understand how it’s done.
      That’s a great point to start. 👍🏻

    • @thereviewer5562
      @thereviewer5562 2 หลายเดือนก่อน

      @@jannismoore that is good to hear.

  • @radoslav07
    @radoslav07 2 หลายเดือนก่อน

    Can you share your replit link? Thanks

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน

      I did! It’s in my resource hub which you’ll find in the description

  • @angeloh-u1q
    @angeloh-u1q 3 หลายเดือนก่อน +2

    I'm surprised that vapi isn't on top of this already.

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน +1

      They are :)

  • @greendsnow
    @greendsnow 3 หลายเดือนก่อน +9

    İt's just way too expensive. Some people payed $3 for 5 minutes, even though the pricing catalogue says it's around 30 cents a minute... Simply unacceptable

    • @alexxandermedeiros
      @alexxandermedeiros 3 หลายเดือนก่อน +4

      Cost will go down soon just like other API costs

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน +5

      You can achieve the same with Vapi by dropping 50k tokens into your master prompt :)
      Anyways, API costs will definitely come down, so that isn’t a concern in my opinion

    • @dazdazfzf
      @dazdazfzf 2 หลายเดือนก่อน

      ⁠@@jannismooreexactly. Just a way to raise the bar of the value of their product because they cannot already scale.

  • @pjm17
    @pjm17 2 หลายเดือนก่อน

    SO could I build a conversational chat app. Basically give someone a person to talk to as they walk around and chat with? are prices too limiting right now??

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน +1

      You can do that, but yes, prices are still limiting as of now.
      I do believe that those will come down quite rapidly.

  • @tuaitituaiti1565
    @tuaitituaiti1565 3 หลายเดือนก่อน

    Hey there. Thank you for tge value bombs you are dropping...Heads up the link to the resource seem to be broken...thanks again

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน

      Appreciate it! Both of the links work when opening them. What do you see once you click on them?

  • @8888-u6n
    @8888-u6n 3 หลายเดือนก่อน

    How do we get acces to the code you made? 👍

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน

      Via my resource hub - the links for that are in the description :)

  • @lakergreat1
    @lakergreat1 2 หลายเดือนก่อน

    could it work with Microsoft Teams Phone? I would like to use it in an IVR setup

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน

      We haven't tried that yet, but if you have a number, you can most likely make calls to it through a provider like Twilio. There are also other approaches that you might be able to leverage long term, such as daily.co

  • @pauledam2174
    @pauledam2174 3 หลายเดือนก่อน

    Can anyone suggest how this could be used for real-time translation? Actually it doesn't need to be voice to voice just voice to text

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน +1

      In that case you might just want to look at Deepgram

  • @jamesballantyne9214
    @jamesballantyne9214 3 หลายเดือนก่อน

    This seems as slow as vapi. What advantages does this, will this have, if it’s the same speed without and of the features of vapi?

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน

      Are you sure you watch your videos on normal playback speed? :D
      I've mentioned some of the benefits in the video. If that's not enough, I'll drop a more detailed one soon.

    • @rarf2142
      @rarf2142 3 หลายเดือนก่อน

      Bro this is not slow at all… You do realise it should sound human and not respond in 0.005 milliseconds? The delay makes it sound human smh

  • @jeelanshahtlyr6076
    @jeelanshahtlyr6076 3 หลายเดือนก่อน

    Jannis is the ONLY way to go when it comes to AI Voice and Automations.

  • @Kevinsmithns
    @Kevinsmithns 3 หลายเดือนก่อน +1

    How can we use it for ai call bots?

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน

      You can use the custom example I showed for Twilio, or you can give it another couple of days and Vapi will most likely have something available too

  • @shanes.6227
    @shanes.6227 2 หลายเดือนก่อน

    can't wait til this kills customer service phone jobs. calling my wireless carrier for something is often a big trouble, taking hours!

  • @NeuralDev
    @NeuralDev 2 หลายเดือนก่อน

    The cost is way too high, we need open source models

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน

      I don't think the price will be that high for long

  • @SzamBacsi
    @SzamBacsi 3 หลายเดือนก่อน

    Laughable. It works in English or German, with simple Indo-European languages. But it dies with Hungarian. instantly.

    • @jannismoore
      @jannismoore  3 หลายเดือนก่อน +2

      I can see what causes your disappointment.
      You'll always see major languages being implemented at a faster pace. Honestly, I'm already impressed it properly handles multilingual conversations as smooth as now, as this was already incredibly hard with the orchestration layers we've seen so far.
      We should be happy about those advancements and help them with enough input to make it even better, which on the other hand will also increase your chances of having better results for other languages.

    • @rarf2142
      @rarf2142 3 หลายเดือนก่อน

      @@jannismooreI hope Dutch works already, I really need a Dutch agent. VAPI starts hallucinating on Dutch and speaking half German after a while lol

    • @SzamBacsi
      @SzamBacsi 2 หลายเดือนก่อน

      @@jannismoore Indeed, I am disappointed, as I have experience applying language models in IVR systems since the 2000s, and I understand that implementing a new model in 2024 should not pose a problem. The underlying issue seems to be a lack of concern for anything outside a specific "cultural" circle. In summary, they simply don't care.
      But I do hope I am mistaken.
      I truly appreciate your videos; they bring a refreshing perspective to this emerging area .

  • @gslvqz8812
    @gslvqz8812 2 หลายเดือนก่อน

    You need to change your thumbnail. It looks evil

    • @jannismoore
      @jannismoore  2 หลายเดือนก่อน

      Seems like you clicked on it nevertheless