Build your own real-time voice command recognition model with TensorFlow

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 ก.ย. 2024

ความคิดเห็น • 32

  • @cornpop3340
    @cornpop3340 ปีที่แล้ว +3

    This is an incredibly helpful video.

  • @donahue1187
    @donahue1187 ปีที่แล้ว +6

    This is fantastic. I’m a Newbie to Python and neural nets, but your explanations are great and pretty straightforward. Question - what additional steps would I take to run this on my own local device (pi 4)? And what else would I need to do to introduce new commands such as as trigger word and “turn off the lights”? Would I need to create my own audio samples, save them to new folders, and retrain to retrain the model? Thanks for any guidance! (if you couldn’t tell I’m DONE w Google Home latency, recreating my own. Ambitious! Need help!)

  • @oxydol3456
    @oxydol3456 6 หลายเดือนก่อน

    This tutorial is great. I find that the key to build accurate model is gathering quality data a lot. And that sounds arduous work. didn't get good result with 200 examples.
    Edit: I found the model's accuracy is the way poor than I expected. Maybe it's due to the microphone I'm using and it's needed to taken care of before predicting process.

  • @gokhanersoz5239
    @gokhanersoz5239 2 ปีที่แล้ว +3

    Thank you very much for the tranings. But I think there should be a more complex and more advanced voice recognition, voice classification and similar training series if you see fit. You know, trainings on sound are limited.

  • @nguyent3465
    @nguyent3465 ปีที่แล้ว +8

    The code on TensorFlow website was changed :(

  • @Cyka_Blyatus
    @Cyka_Blyatus ปีที่แล้ว +2

    What did you do so the program does not picks up ambient noise or actually works with the commands given? it seems the model lacks ambient noise data sets and whenever ran it only keeps spamming the first command, but yours works perfectly, how to achieve this?

  • @seanadin386
    @seanadin386 ปีที่แล้ว +5

    Can you do a video regarding the newer version? The run interface now has a different code

    • @tvartalk
      @tvartalk 4 หลายเดือนก่อน

      😊

  • @erickd4816
    @erickd4816 ปีที่แล้ว +1

    Good video, excellent explanation, I have a question, can the same program be trained to recognize only a specific voice? if so, could you explain it to me? I would be very grateful.

  • @geekyprogrammer4831
    @geekyprogrammer4831 2 ปีที่แล้ว +2

    Can you please post building text to speech models from scratch?

  • @clumsycoder1907
    @clumsycoder1907 ปีที่แล้ว +3

    its not working for me

  • @TheSaukkio
    @TheSaukkio 8 หลายเดือนก่อน

    How can it be that in the video it gives nothing with out speaking. While if i run the code from github it predicts random stuff when im not speaking.

  • @MrIlvis
    @MrIlvis 9 หลายเดือนก่อน

    On which Tensorflow version this was made? because Colab uses latest, but older one should work without problems.

  • @divyakhetan8754
    @divyakhetan8754 8 หลายเดือนก่อน

    Is it a customised model (designed for a single person) or it can work on anyone's command

  • @danielbogemann1598
    @danielbogemann1598 ปีที่แล้ว +1

    They changed the Code. Could u you do a quick update?

  • @swasthikk3655
    @swasthikk3655 7 หลายเดือนก่อน

    Can i get similar for English alphabets

  • @loydvincentbutron4345
    @loydvincentbutron4345 6 หลายเดือนก่อน

    is it for english voice only?

  • @tankado_ndakota
    @tankado_ndakota 3 หลายเดือนก่อน

    Got the error: "Could not import the PyAudio C module 'pyaudio._portaudio'."
    And couldn't find the solution...
    Macbook M1 Pro

    • @tankado_ndakota
      @tankado_ndakota 3 หลายเดือนก่อน +1

      I saw a note in other video for M1 :) let me try first :D

    • @tankado_ndakota
      @tankado_ndakota 3 หลายเดือนก่อน

      i did everything that I found from web. but still i got the error: "symbol not found in flat namespace '_PaMacCore_SetupChannelMap'"

  • @itsrairamones
    @itsrairamones 2 ปีที่แล้ว

    thankyou dude its a hundred percent work for me but after couple minutes it crashed :(

  • @sanjeetjha9177
    @sanjeetjha9177 8 หลายเดือนก่อน

    Please provide me the model i need argently I am stuck in it

  • @rediet.f261
    @rediet.f261 ปีที่แล้ว

    what is sample_file in here 8:38

    • @clumsycoder1907
      @clumsycoder1907 ปีที่แล้ว

      same doubt

    • @arqamrafay
      @arqamrafay ปีที่แล้ว

      exactly, i think their is file of recorded audio

    • @LukasKofler
      @LukasKofler 10 หลายเดือนก่อน

      See the first line at 5:38 🙂

  • @Yvtq8K3n
    @Yvtq8K3n ปีที่แล้ว +1

    Its a shame, you cant train your own model.

    • @threepe0
      @threepe0 ปีที่แล้ว +1

      of course you can

    • @Yvtq8K3n
      @Yvtq8K3n ปีที่แล้ว

      @@threepe0 The last time I used this, you were unable to create a custom model and use it.
      Tensorflow provided you with an already trained model (0-1, left, right) and thats exactly what most people use.