Transcribe Audio Files in your Google Drive with OpenAI's Whisper
ฝัง
- เผยแพร่เมื่อ 14 ต.ค. 2022
- In this tutorial I show how to use OpenAI's Whisper automatic speech recognition model, Google Drive and Colab to transcribe all the audio files in a Google Drive folder for free.
Python notebooks for OpenAI's Whisper: github.com/AndrewMayneProject...
More about OpenAI's Whisper ASR model: openai.com/blog/whisper/
OpenAI's Whisper GitHub: github.com/openai/whisper
Thanks a ton, this is a wonderfully simple tutorial. When I looked up how to use OpenAI Whisper before, the instructions usually involved setting up a docker server and/or required a very powerful GPU.
Thank you Andrew! Very clear and works perfectly!
Thank you, Andrew! This was very helpful! You rock!
Thank you Andrew. I have been looking for this super simple solution
This is brilliant, thank you. Not going to lie, I didn't realise I should open the Python Notebook link and then open in Colaborate as this is a very new world to me, but once I finally figured it out and let the system do it's thing, it has been a game changer. I record audio notes on my watch (Samsung Classic Watch 6), share them to my Google Drive into the WhisperAudio folder, and your magic code does the rest. You rock!
Short slamming stuff. Thanks for getting straight to it with no nonsense.
Awesome, awesome video. Exactly what I was looking for. Thank you!!
Thanks so much for this! Running my first file now.
Thank you Andrew, a life saver!
This is great! Many thanks!
❤Great video! So many similar videos everywhere these days, but you were the one of the fists and easiest to follow. Thank you! I was looking for this! By the way, I'm trying to add "Summarize" function after this transcribing step. I'd appreciate it if you could give me (us viewers) some idea?💙
This is amazing!
What a great video! Thank you for all of your hard work. I was wondering if it is possible to set up some kind of trigger, maybe a time trigger, to run this script periodically, let's say every couple of hours.
Lifesaver, love you g
Hey Andrew, thank you for that wonderfull piece 🙂
Nice video sir! Will it work in other languages as well?
Really cool, it is possible to add speakers identification (diarization) in this model?
Hi, thiswas super helpful. However I had a query. I followed your process to begin transcribing my interviews. These are a mix of English and Hindi. As I ran the transcription, I realised that It was taking really long. A 45 min interview took more than 90 min. is this normal or is there a way I can improve the speed
Hey andrew, is there any way to use the new Whisperai using the API key, to transcribe from my google drive?
Thanks for your useful tutorial. The hardware accelerator list only includes CPU, T4, TPU, A100 GPU, and V100 GPU. How can I make use of M1 Max? Thank you.