Create a Transcription Service with Make.com and Open AI's Whisper API

Philipp Weiser

มุมมอง 3 236

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 25 ส.ค. 2024

ความคิดเห็น • 28

@automate_all_the_things 7 หลายเดือนก่อน ⁺²
If you found my video helpful, maybe also check out this one by Chais Meyer. In his automation, he creates a mini blog post and a Linkedin post from his transcription: th-cam.com/video/eRJSXiXv8oE/w-d-xo.html
@unlimited_leads 8 หลายเดือนก่อน ⁺³
I was running against a wall because I didn't download the file in the 2nd step. Appreciate your content - works like a charm now.
@automate_all_the_things 8 หลายเดือนก่อน ⁺²
Yes that took me some time to figure out as well 😃Glad you found this useful!
@skin.pharma 7 หลายเดือนก่อน ⁺¹
Thanks for the video, Philipp 😊👍
Based on your more recent video about string splitting, if you use 0CodeKit, you could produce a transcription service more cost effectively by using asynchronous python coding.
OpenAI developed Whisper as open source software, so it can be used from a python library within your own code & without charge.
It is also possible to split audio files larger than 25MB in to smaller pieces & concatenate the individual transcriptions 👍
Viv
@automate_all_the_things 7 หลายเดือนก่อน ⁺¹
Thanks for this great comment, Viv! It's certainly more cost effective to run the Whisper transcription locally. And a python script to do this wouldn't be terribly complicated. With my channel, I am trying to show ways to do this with no code at all, to make it more accessible to more people. I'm constantly impressed what is possible with tools like Make for non-programmers.
@user-pr7zf3nm4j หลายเดือนก่อน
Lieber Philipp - Vielen Dank für den Workflow. Im Video hast du selbst den Hinweis gegeben, dass die Begrenzung für transkribierbare Dateien bei 25 MB liegt. Häufig sind die Dateien jedoch größer. Zeig doch bitte einmal, wie die Einbindung weiterer Module in den Workflow aussieht, der auch größere Dateien zu einer brauchbaren Zusammenfassung bringt. Besten Dank im Voraus!
@saulosoares5636 6 หลายเดือนก่อน ⁺²
Great video!!! Thanks a lot!
@automate_all_the_things 6 หลายเดือนก่อน
Glad you liked it, thanks for your comment!
@chrisder1814 หลายเดือนก่อน
hello could you help me understand what result I could get if I add the jungle scout api for Amazon sellers to my scenario
@ricardotaipe 3 หลายเดือนก่อน
You are amazing. Great tut, perfectly explained!
@automate_all_the_things 3 หลายเดือนก่อน
Thanks! I'm glad you found this helpful!
@In2DeepDaDeejay 2 หลายเดือนก่อน
Hey Philipp! Love your content. Can you make a similar flow using Whisper, taking the transcript info and create headings for the main points and create variables to be captured into a Google Sheet. That would be great!
@chrisder1814 26 วันที่ผ่านมา ⁺¹
hello I had some ideas for automation can you tell me what you think of them
@automate_all_the_things 26 วันที่ผ่านมา
Sure, lets hear them :) You can also reach me via email: philipp@lazy-android.com
@redantegabanan1078 5 หลายเดือนก่อน
Transcription does not have timestamp when using Whisper module, whats the workaround for this?
@AIEntusiast_ 5 หลายเดือนก่อน ⁺¹
very nice, but when running this again with a different file same as first sucessfull test, meaning same format on files it failes saying invalid format ? any suggestions
@automate_all_the_things 5 หลายเดือนก่อน ⁺¹
Hmm, maybe the second file is too big? Whisper can only handle files up to 25 MB in size.
@AIEntusiast_ 5 หลายเดือนก่อน
@@automate_all_the_things it`s smaller and same format, for some reason deleting the first test file and also delete google doc file it worked with a new file
@JeffHarrison1960 6 หลายเดือนก่อน ⁺¹
Phillip, thank you for your tutorials. I find them extremely helpful, as I just getting started with Make and Bubble. Curious if there is a rapid API that could be inserted before the audio file hits whisper to cut it into smaller (25mb ) and then as it comes out of whisper to reassemble so one only has one transcription and one summary per audio file?
@automate_all_the_things 6 หลายเดือนก่อน ⁺²
Thank you, very glad to hear you found this helpful! If the 25MB filesize limit is a problem, you could use the "Cut Audio File" action from 1001fx to create smaller audio files. It's built into Make. If your source audio file is a giant .wav format, you could also try to convert it to a smaller .mp3 file with a CloudConvert "Convert a File" action.
@DrivenSpice 5 หลายเดือนก่อน
@@automate_all_the_things That's super helpful! I had the same questions. I was trying to transcribe a zoom recording and I kept getting the error of file size. Grateful for the content you created. Would you consider showing the process with a third party that will cut the file easily in a make automation? Many of the tutorials are showing transcription with short files.
@DrivenSpice 5 หลายเดือนก่อน
Also, will cutting the file effect the SRT file?
@jimcrow2000 6 หลายเดือนก่อน
hey great video, would it work from a video added to the google drive, or would we need to have the audio extracted first? thanks
@automate_all_the_things 6 หลายเดือนก่อน
The Whisper API can only handle these file formats (mp3, mp4, mpeg, mpga, m4a, wav, and webm) so you'd have to convert a video file, or detach the audio. There is a service called Transloadit integrated with Make. They have a free tier and offer a "Converts a video to WebM" action. Maybe you could use that to convert your video to webm and then send that to Whisper.
@MiniDolarB3 8 หลายเดือนก่อน ⁺¹
Make isn't good for large audio files > 25MB (Whisper limit) and long texts input. No chuncking option
@automate_all_the_things 8 หลายเดือนก่อน ⁺¹
Yes, the 25 MB file size limit of the Whisper API can become a problem. You could add a "Cut Audio" module from 1001fx into the scenario. Or convert a large audio into a smaller file format with a CloudConvert module. Is Zapier or some other option better at handling large audio files?
@MiniDolarB3 8 หลายเดือนก่อน
@@automate_all_the_things Large text is also an issue since the API can't process. Needs to chunk and combine.
@hownowLLC 4 หลายเดือนก่อน
@@automate_all_the_things Respell has a good template for splitting then recombining the audio files for that :)

ต่อไป

เล่นอัตโนมัติ

This Automation Writes Entire Ebooks with One Click!