F5-TTS & E2 TTS Google Colab Tutorial

Neural Falcon

มุมมอง 8 827

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 14 ม.ค. 2025

ความคิดเห็น • 106

@AbdullahJahangirr หลายเดือนก่อน ⁺¹
Happy 1k subs
@neuralfalcon หลายเดือนก่อน
Thank you bro
@angelochu3156 3 หลายเดือนก่อน ⁺⁵
I watched many videos about F5-TTS on youtube. You are the only one who can clearly compare the original sound and clone sound in a clear manner to the watcher. Keep up the good work!
@neuralfalcon 3 หลายเดือนก่อน
Glad I could help!
@dzrook 3 วันที่ผ่านมา
thank you thank you thank youuu.
Neural Falcon you saved me a lot of time THANK YOU SO MUCH
@neuralfalcon 3 วันที่ผ่านมา
Glad it helped!
@mekkicharfi5454 2 หลายเดือนก่อน ⁺¹
Thank you very much and especially for your patience
@QHawk7 2 หลายเดือนก่อน ⁺¹
*Great Video , thanks, Try dubbing a short documentary and import a deep voice, let's see what we can do with all available AI tools & colabs at this moment*
@MR.VAN1979 2 หลายเดือนก่อน ⁺²
Your videos bring a lot of value to the community and are worthy of 1 subscription, 1 like, and 1 comment. I wish you good health and make many valuable videos for everyone to learn and follow.
@dkerdnase 3 หลายเดือนก่อน ⁺¹
Thank you so much man! You're awesome!
@Dex383-d8d 2 หลายเดือนก่อน ⁺¹
I have already tried everything in the video and it is indeed very easy to use, the AI has its problems but I guess it will improve over time, the part of cloning the voices works 100 out of 10, I managed to confuse a friend with his own voice speaking in another language which was very funny.
Thank you very much for the video and for taking the time to respond to my first comment
@xenn2996 3 หลายเดือนก่อน ⁺¹
thanks for the tutorial
@neuralfalcon 3 หลายเดือนก่อน
Happy to help
@syntaxstreets 3 หลายเดือนก่อน
2nd audio and first model super
@lsgzmc5806 2 หลายเดือนก่อน ⁺¹
Pls make a video on how to use multi-speech option of this model, I'm having troubles using it
@neuralfalcon 2 หลายเดือนก่อน
11:18 watch this video th-cam.com/video/6i0cXSvyz98/w-d-xo.htmlsi=IZ8FKfAD7l0sqmgV
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
Use the format {emotion_name} your_text.
For example:
If the emotion is "happy": {happy} I won a prize.
For multiple emotions: {happy} I'm happy. {angry} I'm angry. {sad} I'm sad.
There’s no set order. Just indicate the needed emotion in curly braces before each sentence, like {emotion} your_text.
Make sure you label those reference audio files the same as your emotion_name.
@lsgzmc5806 2 หลายเดือนก่อน ⁺¹
@neuralfalcon thx for helping me out
@411KJB 2 หลายเดือนก่อน ⁺¹
Excellent!
@gg69155 12 วันที่ผ่านมา
Can you make a colab version for Fish Speech tts?
@neuralfalcon 12 วันที่ผ่านมา ⁺¹
Someone already did it. Try this 😀
colab.research.google.com/drive/1trBvrdgyI-Ntd45ZnlT5lhGsI_HnKjC1?usp=sharing
@gg69155 11 วันที่ผ่านมา
@@neuralfalconI tried that one, but I can't seem to make it work. Could you...make a tutorial on how to run it? 🥺🙏
@neuralfalcon 11 วันที่ผ่านมา ⁺¹
@@gg69155 i will try
@gg69155 11 วันที่ผ่านมา
@@neuralfalconthank you 😊
@Jerometk หลายเดือนก่อน
Do you have the same but for lipsyn? Something on google collab or similar? I want to lipsync audio and a video, not an image.
@neuralfalcon หลายเดือนก่อน
Yes, we have Wav2Lip.
github.com/Rudrabha/Wav2Lip
My google Colab link:
github.com/NeuralFalconYT/wav2lip
@harshvaghanii 2 หลายเดือนก่อน
I've got an error in second step saying -> name 'base_path' is not defined
@neuralfalcon 2 หลายเดือนก่อน
Because, you forgot to run the cell above, where base_path = "/content".
Run the cell above first, then run the next one afterward.
@pneuma23093 2 หลายเดือนก่อน ⁺¹
2:57 That's Dva right?
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
Yes
@Carlon15 2 หลายเดือนก่อน
Can you make a video about how to train your model in a different language, please?
@neuralfalcon 2 หลายเดือนก่อน
github.com/SWivid/F5-TTS/discussions/143
th-cam.com/video/RQXHKO5F9hg/w-d-xo.html
@neuralfalcon 2 หลายเดือนก่อน
Watch this video: th-cam.com/video/GmketyZW2c4/w-d-xo.html
@neuralfalcon หลายเดือนก่อน
Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html
@vodkalikpatates หลายเดือนก่อน
Thank you for the video! It's really helpful! 🙌How can i use it with another model? like, i want to try with "F5-TTS-Turkish". how can i add it properly
@neuralfalcon หลายเดือนก่อน
Search on Google to find out if someone has trained an F5TTS model for the Turkish language or train your own model.
To learn how to train in different languages watch this video:
th-cam.com/video/UO4usaOojys/w-d-xo.htmlsi=uzMKfs6sdDloKU9a
@vodkalikpatates หลายเดือนก่อน
@@neuralfalcon There actually is a Turkish language model. I meant to ask how can I use that with your code, since it doesn't have custom model option in ui
@QHawk7 2 หลายเดือนก่อน ⁺¹
Can I get this to work on kaggle?
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
Yes
@QHawk7 2 หลายเดือนก่อน
@@neuralfalcon
How?
@neuralfalcon 2 หลายเดือนก่อน
@@QHawk7
github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
You may need to run:
!sudo apt install ffmpeg
Ensure you are connected to a GPU runtime.
You may also need to install torch if PyTorch is not pre-installed on Kaggle by default.
github.com/SWivid/F5-TTS
@snakezo4218 2 หลายเดือนก่อน
is there a way to speak with our voice and make a transfer to this voice to reproduce the emotions of tones you know
let's imagine that I play the game of an angry person can the cloned voice reproduce this angry voice ?
@neuralfalcon 2 หลายเดือนก่อน
Easy, Record a short, 15-second audio clip where you speak in a specific tone, like angry, sad, or happy. Use this audio as a reference in F5 TTS, and the output voice will match your chosen emotion, such as anger.
@EphemeralInferno หลายเดือนก่อน
When I do it, it says
"No module named onx"
@neuralfalcon หลายเดือนก่อน
yeap, new bug
@hiepinh5599 หลายเดือนก่อน
can i training with own voice, for example: optimus voice..
@neuralfalcon หลายเดือนก่อน
Yes 100%, copy this notebook and use F5-TTS colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
@hiepinh5599 หลายเดือนก่อน
I checked your collab, and it doesn't work
@neuralfalcon หลายเดือนก่อน
It's working
@hiepinh5599 หลายเดือนก่อน
@@neuralfalcon thank you, it worked. Now I have a checkpoint file trained through TTS-F5 but I don't know where to inference through, can you help me, I need python script
@411KJB 2 หลายเดือนก่อน
Link no longer works. Any new links?
@neuralfalcon 2 หลายเดือนก่อน
colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
Or follow official instructions:
github.com/SWivid/F5-TTS
@411KJB 2 หลายเดือนก่อน
It was PERFECT for that window though and I thank you so much.
@Deewayne94 หลายเดือนก่อน
Hello, can i also clone a voice in french?😊
@neuralfalcon หลายเดือนก่อน
Yes, but you either need to train the model in French yourself or wait for someone else to do it.
The best option right now is to pay for a service like ElevenLabs.io to clone your voice.
@neuralfalcon หลายเดือนก่อน
huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced
@asfandsherazkhan9135 หลายเดือนก่อน
can we dubbed into other language like from english to hindi
@neuralfalcon หลายเดือนก่อน
It only supports English and Chinese , but you can train it in other languages. Watch this video to learn how:
th-cam.com/video/GmketyZW2c4/w-d-xo.html
@neuralfalcon หลายเดือนก่อน ⁺¹
huggingface.co/SPRINGLab/F5-Hindi-24KHz
@neuralfalcon หลายเดือนก่อน
F5-TTS Hindi: th-cam.com/video/Pb3Zx562Juw/w-d-xo.html
@snakezo4218 2 หลายเดือนก่อน
I tried, is it possible to make him speak with a French accent, he still has difficulty or can I speak to the creator to ask him the question?
@neuralfalcon 2 หลายเดือนก่อน
th-cam.com/video/RQXHKO5F9hg/w-d-xo.html
@neuralfalcon หลายเดือนก่อน
Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html
@neuralfalcon หลายเดือนก่อน
huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced
@kanavwastaken 3 หลายเดือนก่อน
Can you please make it work on LightningAI bro?
@neuralfalcon 3 หลายเดือนก่อน
github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5-TTS-lightning-ai.ipynb
Download this notebook and upload it to lightning.ai/. Make sure to switch to GPU.
@PratikshaPatil-r9o 3 หลายเดือนก่อน
HEY.. IS THE PROCESS FOR E2 IS SAME?
@neuralfalcon 3 หลายเดือนก่อน
Yes same, just choose E2-TTS button
@deepakmannu1 7 วันที่ผ่านมา
how to install F5-TTS bro
@neuralfalcon 7 วันที่ผ่านมา
pip install git+github.com/SWivid/F5-TTS.git
f5-tts_infer-gradio
@neuralfalcon 7 วันที่ผ่านมา
For English: colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb#scrollTo=dB3clv5KWtQG
@neuralfalcon 7 วันที่ผ่านมา
For Hindi:
colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Hindi_Small.ipynb
@priyakumari-ky4nn 2 หลายเดือนก่อน
F5 tts Can Support Hindi voice Give Answer ?
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
It only supports English and Chinese , but you can train it in other languages. Watch this video to learn how:
th-cam.com/video/GmketyZW2c4/w-d-xo.html
@priyakumari-ky4nn 2 หลายเดือนก่อน
@@neuralfalcon Please you can make video realistic hindi tts voice
@neuralfalcon หลายเดือนก่อน
Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html
@neuralfalcon หลายเดือนก่อน
huggingface.co/SPRINGLab/F5-Hindi-24KHz
@neuralfalcon หลายเดือนก่อน
F5-TTS Hindi: th-cam.com/video/Pb3Zx562Juw/w-d-xo.html
@tuannv9119 9 วันที่ผ่านมา
Làm sao để xóa chúng khỏi máy tính vậy?
@neuralfalcon 9 วันที่ผ่านมา
If you used a virtual environment delete the f5-tts folder and clean windows cache folder.
or, if you used just pip to install f5 tts
pip uninstall F5-TTS
and clean windows cache folder.
@Dex383-d8d 2 หลายเดือนก่อน
Why did the page ask me for permission to use my microphone? Do not enter the pinned link, you will probably be hacked... The video seemed useful but better not risk it
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
Thank you for your comment! It sounds like you might not be familiar with how Gradio applications work. The page requests microphone permission because the app needs to record or upload audio in order to clone it. Our code prioritizes recording audio before launching the app, which is why microphone access is required. If you're interested, you can learn more about this in the Gradio documentation here: www.gradio.app/docs/gradio/audio .
@Dex383-d8d 2 หลายเดือนก่อน
@@neuralfalcon Thank you very much for replying to my comment, I will read the documentation, it is true that I am not familiar with the application
@RostinSino 2 หลายเดือนก่อน
does it work in indonesian language?🙏
@neuralfalcon 2 หลายเดือนก่อน
For now it's a big 'NO' . You need to train on Indonesian language from scratch. You can use elevenlabs but it's paid.
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
Watch this video: th-cam.com/video/GmketyZW2c4/w-d-xo.html
@neuralfalcon หลายเดือนก่อน
Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html
@abhishekkumar-bz1ql 2 หลายเดือนก่อน
Will it work with hindi language
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
For now it's a big No, you need to train for other languages From scratch
@abhishekkumar-bz1ql 2 หลายเดือนก่อน
@@neuralfalcon do you know how to train it? Or any reference video of it?
@neuralfalcon 2 หลายเดือนก่อน
@@abhishekkumar-bz1ql github.com/SWivid/F5-TTS/issues/87
@neuralfalcon หลายเดือนก่อน
Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html
@neuralfalcon หลายเดือนก่อน
F5-TTS Hindi: th-cam.com/video/Pb3Zx562Juw/w-d-xo.html
@QHawk7 2 หลายเดือนก่อน ⁺¹
*Is it Multi-language?*
@neuralfalcon 2 หลายเดือนก่อน ⁺¹
Only English and Chinese
@neuralfalcon หลายเดือนก่อน
Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html
@weini-sf3pu 2 หลายเดือนก่อน
when use Generate TTS, get an error " FileNotFoundError: [Errno 2] No such file or directory: 'nvidia-smi' ", can you help me ?
@neuralfalcon 2 หลายเดือนก่อน
@@weini-sf3pu yes send screenshot at NeuralFalcon@proton.me
@neuralfalcon 2 หลายเดือนก่อน
@@weini-sf3pu first you need a GPU to use 'nvidia-smi' then if you are running in a jupyter notebook '!nvidia-smi'
Or if you are running in terminal just 'nvidia-smi'. Else you can skip this.
Use another way to find the cuda version to install the pytorch .
@Ice_camp 3 หลายเดือนก่อน
uncheck remove silence
@neuralfalcon 3 หลายเดือนก่อน
You can uncheck the silence option, which may create silence in the generated audio .

ต่อไป

เล่นอัตโนมัติ

F5-TTS! They DID IT! Perfect voice clone with Emotion with a 10-second sample!