I've spent 5 days trying to figure this out knowing nothing literally about nothing and by this I mean virtual assistants you can talk to. With the learning curve I will say I wish I watched your video first. Your the best to learn from so far my man nice job.
yo this tutorial is great as I have tried to watch other tutorials as a noob but they talk like I know the stuff but I dont but you explain the stuff to me great
ok so it's a bit simple, the problem is if you stop talking and you have to make sure the text is justified and doesn't end up at the end of a line every time you interrupt yourself
nvm figured it out. for those of you asking about the last step for windows os, go to note pad and type in " C:\Users\[your computer]> " then the first prompt then do the same for the second. save the file as a .txt. it should work from there.
yo! I'm having the same issue you had. I'm new to pycharm so I'm not sure what you mean with notepad. Where can I find it in the UI? Did you mean the terminal? or is there somewhere else I'm supposed to find the 'notepad' window. sorry if this is a very dumb question lol
On the last step I'm hung up. I'm on the latest windows os, Are you running the touch output.txt and tail output on macs equivalent of command prompt? I ran it on mine and came up with " 'touch' is not recognized as an internal or external command, operable program or batch file." where did I go wrong?
Gm, great vid! I am using python 3.12 on Mac(M1) and I’m unable to import pyaudio! Error states, “ Failed to build installable wheels for some pyproject.toml based projects (pyaudio)” I have searched high n low to solve this issue? Any thoughts or direction? Thanks in advance
how do i see the appended msgs in the terminal? it worked and I could open the output txt file and see what i was saying, but I'm not sure how to see it realtime ig (using Windows 10 and the windows subsystem for linux to install kali linux, and then opened bash in cmd). also when i stop the script (using pycharm) it spits back a bunch of errors and I'm not sure why, or at least i think they're errors
I found the python library to be enough personally. Though, I would imagine whisper to be better given that it's made by openAI. I'd expect it to be a better trained AI
This is just working fine for the first run. After the first run the text generated takes about 10 mins to get to the output. How can i fix that thing and make it just as fast as the first run
My language Hmong is not available for voice to text and I have been searching for programs or ways to show me how it's done, but can't find any. Not sure if Python is the solution. Frustrating of not knowing where to start.
I need a thing that does a few steps less than what this does. I only want it to record phonetics, maybe spacing between words, maybe intonation, and maybe recognize when I'm pausing to remember a word.
how do you get terminal? When I clicked on terminal, new shows me and there was PS C:\Users\[my name]>, but as I can see, in ur it's not. Any way how to fix it?
At the top bar of the terminal where it says "problems" click on the drop down arrow next to the "+" in the further right of that bar, then click on "command prompt", you should be able to run the code through there. At least thats what I did and it worked for me.
Hi Coach! MrScgaming28 and I, from the comment section seem to have the same issue, could you re-explain how to do the last step of making that output file and tailing the outputs in a windows OS?
I have an error with the speech_recognition library: r.adjust_for_ambient_noise(source2, duration=0.2) TypeError: Recognizer.adjust_for_ambient_noise() missing 1 required positional argument: 'source' Any ideas?
i am getting trouble with this error: 1 error generated. error: command '/usr/bin/clang' failed with exit code 1 [end of output] note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for pyaudio Failed to build pyaudio ERROR: Could not build wheels for pyaudio, which is required to install pyproject.toml-based projects I cant install the two last kind of stuffs you gave us, hope you reply soon, thank you for that nice work sir.
Did this. I have Mac OS so If you're using macOS, you can install the necessary tools using Homebrew. First, make sure you have Homebrew installed, then run: brew install portaudio After installing portaudio, you can try installing PyAudio again using pip3 install pyaudio.
I've spent 5 days trying to figure this out knowing nothing literally about nothing and by this I mean virtual assistants you can talk to. With the learning curve I will say I wish I watched your video first. Your the best to learn from so far my man nice job.
That means a lot man. Thanks so much for the feedback.
I had nooooo idea this was so straightforward! Thank you! I want to create a translator and I think this is the perfect base!
yo this tutorial is great as I have tried to watch other tutorials as a noob but they talk like I know the stuff but I dont but you explain the stuff to me great
this worked so well. i never knew it was that easy until you explained it. Hope to learn more from you thanks
Hey Oscar this was a great tutorial, keep doing the work you are doing and this was very clear and it made perfect sense.
Thanks for the feedback. It means a lot. Going to ramp up the channel with more content in the coming months :)
ok so it's a bit simple, the problem is if you stop talking and you have to make sure the text is justified and doesn't end up at the end of a line every time you interrupt yourself
nvm figured it out. for those of you asking about the last step for windows os, go to note pad and type in " C:\Users\[your computer]> " then the first prompt then do the same for the second. save the file as a .txt. it should work from there.
What second prompt are you talking about? I wrote that in the notepad and yet my program sometimes outright refuses to work.
yo! I'm having the same issue you had. I'm new to pycharm so I'm not sure what you mean with notepad. Where can I find it in the UI? Did you mean the terminal? or is there somewhere else I'm supposed to find the 'notepad' window. sorry if this is a very dumb question lol
Cool stuff! I have struggled so much with language as I’m extremely dyslexic. ChatGPT has opened up the door 🚪 for me! 😊
That's awesome :D So glad ChatGPT is able to help you :)
On the last step I'm hung up. I'm on the latest windows os, Are you running the touch output.txt and tail output on macs equivalent of command prompt? I ran it on mine and came up with " 'touch' is not recognized as an internal or external command, operable program or batch file."
where did I go wrong?
same here bruh
im using windows im not so sure if the problem is the commands "echo. > output.txt and type output.txt | more"
Gm, great vid! I am using python 3.12 on Mac(M1) and I’m unable to import pyaudio! Error states, “ Failed to build installable wheels for some pyproject.toml based projects (pyaudio)” I have searched high n low to solve this issue? Any thoughts or direction? Thanks in advance
Can you tell me how to do the last step of making that output file and tailing the outputs in a windows OS?
did you find a way
In windows it will record your audio & print it to the notepad, but you need to restart your notepad to see it.
@@arvindh13 Hey! I'm also stuck at this step. Can you pls explain like where do I've to put the commands to touch and tail the output?
Thank you for the tutorial. its works now 👋
how do i see the appended msgs in the terminal? it worked and I could open the output txt file and see what i was saying, but I'm not sure how to see it realtime ig (using Windows 10 and the windows subsystem for linux to install kali linux, and then opened bash in cmd). also when i stop the script (using pycharm) it spits back a bunch of errors and I'm not sure why, or at least i think they're errors
I did this by running the tail command in another terminal. Though, you could also change line 43 to be print(text) rather than print("Wrote text")
@@CSCoachwhats the name of the terminal you used?
This is great, thanks for sharing you tips and tricks.
You bet!
@@CSCoach what is the name of the app u used in the video ??
You are awesome! It would be great if you teach us how to send generated text from recognizer into "Text to speech" . thanks for the video!
he already did something like that in which he created a jarvis like program th-cam.com/video/BEw5EFqCCEI/w-d-xo.htmlsi=wEeSWa18kFPppBUF
Thanks for the course, but does anyone know how can i do it by using an audio file and not a voice recognition ?
It keep telling me Zach command not found when I try installing the dependency please help
Hi coach. do you think maybe using openai's whisper will have a more accurate outcome for transcribing speech?
I found the python library to be enough personally. Though, I would imagine whisper to be better given that it's made by openAI. I'd expect it to be a better trained AI
You can use whisper offline. I suggest faster-whisper or whisper.cpp
This is just working fine for the first run. After the first run the text generated takes about 10 mins to get to the output. How can i fix that thing and make it just as fast as the first run
its saying AttributeError: module 'speech_recognition' has no attribute 'Recognizer'
Any git repo link?
My language Hmong is not available for voice to text and I have been searching for programs or ways to show me how it's done, but can't find any. Not sure if Python is the solution. Frustrating of not knowing where to start.
I need a thing that does a few steps less than what this does. I only want it to record phonetics, maybe spacing between words, maybe intonation, and maybe recognize when I'm pausing to remember a word.
one question, this works only for english lenguage? or can be used with other languages?
For me whenever i say something, the terminal shows wrote text but idk where it writes the text(im on windows btw) can anyone plz help
it would be creating a text file with name "output.txt" and writing into it
Thank you
i am doing this on windows os and i cant seem to find where the text is saved.... can someone please help
Could you please post a link to the full file
hey great video! would this work for other languages?
I also hace the same question
how do you get terminal? When I clicked on terminal, new shows me and there was PS C:\Users\[my name]>, but as I can see, in ur it's not. Any way how to fix it?
At the top bar of the terminal where it says "problems" click on the drop down arrow next to the "+" in the further right of that bar, then click on "command prompt", you should be able to run the code through there. At least thats what I did and it worked for me.
Think of integrating it with chatgpt and getting answers for it just by using the text which we have received from speech to text
Haha, did you see this video I made :) I believe I do what you suggested in the comment.
th-cam.com/video/BEw5EFqCCEI/w-d-xo.html
what program did you use to do this?
Did it in python :)
He used "VS Code" and the language used for the program is Python.
@@Bartdu59Gamingwhats the name of the terminal he used?
Hi Coach! MrScgaming28 and I, from the comment section seem to have the same issue, could you re-explain how to do the last step of making that output file and tailing the outputs in a windows OS?
Create a text document & copy the source --> paste it in the program.
@@arvindh13 can you explain this? program?
hi it's only desktop solution ?
How can we add languages to it?
You need pre designed language libraries or you need to make a new one with a lot of data.
I still don’t understand how to download the libraries, I have a Mac m2 and documentation u gave didn’t help me. So can u or anyone else help me?
You have to type in the terminal "pip3 install ...."
I am going to work on adding a "trigger" word that will let the program know I want to act.
That's cool!! If you get that to work, I'd love to know how
Hey did you end up figuring this out?
How to stop recording the voice?
Hi oscar does it works for french
Does anyone know how to out it in a user-friendly app so anyone can use it easily?
if i want to do only speech to text, then I don't need to install pyaudio, right??
Yes, you need Pyaudio & Speech recognition fork library
cant get the code
Could this also work with a other Language?
Programming language? Or the text is outputted to a different language? Either way I'm pretty sure the answer would be yes :)
You did Not show much also where is the Git repository link. Smartie
why import pyttsx3? it is never used?
It's not required yes.
what app he used in this video??
Visual Studio Code
where are u writing the code
vscode
A real king
i couldnt import pyttsx3 it says unused and alot of error there 😂
you need to install it, it's not an inbuilt library.
thanks
I have an error with the speech_recognition library:
r.adjust_for_ambient_noise(source2, duration=0.2)
TypeError: Recognizer.adjust_for_ambient_noise() missing 1 required positional argument: 'source'
Any ideas?
You need to provie the source before that, please check the program again and if it does not work, install Speech recognition fork library.
got this error, fixed it by adding() to Recognizer
Does this work offline ?
^
i am getting trouble with this error:
1 error generated.
error: command '/usr/bin/clang' failed with exit code 1
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building wheel for pyaudio
Failed to build pyaudio
ERROR: Could not build wheels for pyaudio, which is required to install pyproject.toml-based projects
I cant install the two last kind of stuffs you gave us, hope you reply soon, thank you for that nice work sir.
Getting same error. Hope someone can help
Did this. I have Mac OS so
If you're using macOS, you can install the necessary tools using Homebrew. First, make sure you have Homebrew installed, then run:
brew install portaudio
After installing portaudio, you can try installing PyAudio again using pip3 install pyaudio.
had this issue on widows fixed it by running as admin
Your screen not very clear
my name is oscar too haha
source code
I continue your work ! thank you! for your video was essential for my Idea. th-cam.com/video/W6JcI9Qw8aU/w-d-xo.html