Even FASTER Generation on Tortoise by Removing the Diffusion Model

Jarods Journey

มุมมอง 9 125

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 27 ม.ค. 2025

ความคิดเห็น • 60

@Jarods_Journey ปีที่แล้ว ⁺³
Timestamps:
What was changed? - 0:38
Debugging and getting it running on my system - 2:15
Testing Generation Speed - 9:45
@moenielsen3218 11 หลายเดือนก่อน
The deepspeed = True, parameter doesn't work for some reason (Windows 11).
@sownheard ปีที่แล้ว ⁺¹⁰
3 seconds of generations for
17 seconds audio 😮 makes it possible to have near real time text to voice conversion.
Thats huge.
@GrocksterRox ปีที่แล้ว ⁺¹⁴
That's amazing - for us non-coders, if there's a way to incorporate that into an automatic install and usable via the visual interface, that would be awesome!
@kshitijnakhashi9818 ปีที่แล้ว ⁺²
Awesome work Manmay Nakhashi
@dnero6911 ปีที่แล้ว ⁺⁵
I'm really looking forward to the next version of the Audiobook Maker now ;)
@Because_Reasons ปีที่แล้ว
link?
@jbeen8three ปีที่แล้ว ⁺¹
is there going to be a cleaned-up tutorial on how to make the updates?
@vishalkhombare ปีที่แล้ว ⁺²
Great News!! Wish you could build a code which can run through RVC model superfast
@Mowgi ปีที่แล้ว
Oooh, exciting. That's incredible!
@jazza422 ปีที่แล้ว ⁺²
I hope this gets added to the MRQ repo very soon, can you share your fork or something?
@Jarods_Journey ปีที่แล้ว
This is actually the main tortoise branch, all of the code it there
@jazza422 ปีที่แล้ว
@@Jarods_Journey yea i saw that now, I managed to get it working with mrq-tortoise repo with deepspeed and the results are pretty good, a lot better than vits especially with rvc and fine tuned model
@RobertJene ปีที่แล้ว
I have your tortoise GUI installation tutorial in a doc.
but didn't you also do a separate video on how to use it?
if so, please link it here or tell me the video name so I can search your channel
@_IamVoid_ ปีที่แล้ว ⁺²
Hi, could you make a video about how to use tortoise tts in other languages for example polish?
Because i’ve been using this tts and i must say it’s truly amazing. Keep putting in the work👍
@pedrazzi65 ปีที่แล้ว
That's a great idea!
@tommov2934 ปีที่แล้ว ⁺¹
any news on if and when this will be added to the audio book?
@Jarods_Journey ปีที่แล้ว
Probably 3 weeks out or so, depends on somethings that I'm working on!
@swannschilling474 ปีที่แล้ว
Thanks for showing the whole process, took a while to get all this to work for me too! How would you pass Samples, Iterations and Temperature in the Fast API...or is this not possible?
@Jarods_Journey ปีที่แล้ว ⁺¹
Well, with hifigan, there is no samples or iterations because those are features of the diffusion model. Temperate should still be possible, though, it's buried in the code somewhere.
@swannschilling474 ปีที่แล้ว
@@Jarods_Journey that glados-tts that I used a bit also had a hifigan vocoder, it was super fast! I am just getting started on tortoise, since the latest increase in interference time make it very interesting for real time voice generation!
@MinutoDirecto ปีที่แล้ว
Thanks for sharing.
@sfonetwo ปีที่แล้ว
I know I've got notifications on, but I keep on refreshing this channels home page, in the hope that there might be a new video.🤣 (thanks for you work)
@MarioGuimaraesAC ปีที่แล้ว
I don't know if you have any idea of the scale of your work. I'm in Brazil and following what you've been doing. Your work is amazing. I'm going to start testing your audiobook project, where do you recommend starting? Because I saw that at one point you used the ai voice clone and at another moment you used tortoise-tts or did I miss a point?
@Jarods_Journey ปีที่แล้ว ⁺¹
Appreciate it, and I'm glad my work is being seen from all across the globe :)! This video is me going over tortoise-tts using the main github repo which is from neonbjb. For all of my other videos, I use the AI Voice Cloning repository from mrq.
If you are going to try out the audiobook project, start with the AI Voice Cloning repository by mrq as that is what is compatible with my project.
In this video, I use the main repo because hifigan has not yet been incorporated into the mrq repo.
@jinkouproject ปีที่แล้ว ⁺¹
That is very very interesting. Just wonder how good this will perform for voice cloning? Have you tried it with a fine tuned model as well? I wonder how good it will be able to clone voices if there are no reference files and it purely relies on the fine tuned weights
@Jarods_Journey ปีที่แล้ว
It works well with custom models (I'm using one in the video). I believe you need the reference files for it to work, but haven't tried without them.
@aidanblah9646 ปีที่แล้ว
Thank you for your informative videos. I fallow you because you're clear and to the point. I just did another clone with your guidance. I could really use this update but I'm afraid of breaking my Tortoise. Is this as simple as doing the update or will I have all the issues you had? I don't code so it was confusing, I normally just fallow and do what a video says to do, and cut and paste code. I couldn't tell if you were having issues because you were doing the back a forth test or if the update wasn't good, or if it was just a simple update. I would try to fallow you, but I don't think this was a "how to", because you went so fast. Should I wait to update, considering my coding ignorance?
@Jarods_Journey ปีที่แล้ว
This wasn't a how to unfortunately, and did incorporate this into the authors latest repository, though, he hasn't been active. You might have to wait for a future video I do on it
@erikgoldman ปีที่แล้ว
can you explain how to take a trained voice from the MRQ ai-voice-cloning project and use it here?
@bunnystrasse ปีที่แล้ว
@Jarods_Journey do you know a software to change the voice in a voice recording to another voice? From male to female?
@kanavwastaken ปีที่แล้ว ⁺¹
Do you plan to add that to the Audiobook Maker?
@Jarods_Journey ปีที่แล้ว ⁺⁵
Yup :)
@kanavwastaken ปีที่แล้ว ⁺¹
@@Jarods_JourneyReal Time listening, huh? 👀 RVC is already quite fast, can't wait for it man.
@dnero6911 ปีที่แล้ว
awe! I love you @@Jarods_Journey
@joemartin2228 ปีที่แล้ว ⁺¹
You’re the best Jarod :)
@LakshyaKhanna-t1m ปีที่แล้ว
Hey, where can I refer the code for test.py to test api.py and api_fast.py.
Also, I want to fine tune the model. Which code should I use?
@Jarods_Journey ปีที่แล้ว
You can check out the original repo which is neonbjb's
@PwninMcduff ปีที่แล้ว
Will RVC or UVR fix that distortion or breathy sounds? If not how are you resolving those? That's become my biggest issue and have had to become a quasi sound engineer to editing audio files
@Jarods_Journey ปีที่แล้ว ⁺¹
RVC and UVR both do an OKish job at filtering them out, but if there's too much distortion, RVC can't handle it.
@PwninMcduff ปีที่แล้ว
@@Jarods_Journey appreciate the response, thank you
@antongritsyk3070 ปีที่แล้ว
I don't care about voice cloning or audio quality. How do I use it just to read text? Where do I find actual docs? is that readme really all that's available? bro...
@jakeparker918 ปีที่แล้ว
My guy opened Powershell instead of GIT Bash just to trigger people
@moenielsen3218 11 หลายเดือนก่อน
The deepspeed = True, parameter doesn't work for some reason (Windows 11).
@Murderface666 ปีที่แล้ว
I'm more confused than watching the ending of Inception.
@agenticmark 7 หลายเดือนก่อน
taking out the diffusion model speeds it up but it sounded awful
@sfonetwo ปีที่แล้ว
Amazing, Can this be intergraded into the Web UI?
@Jarods_Journey ปีที่แล้ว ⁺¹
If someone wants to integrate it :)!
@AI-wf9he ปีที่แล้ว
TIL you can get rid of the annoying warning of changes by typing git stash. Hours of work saved. Thank you.
@greenockscatman ปีที่แล้ว ⁺¹
This is a huge speed increase! They're gonna need to come up with a new name, man. Tortoise just isn't accurate any more lol
@speedeespeedboi9527 11 หลายเดือนก่อน
it works but it generates british accent. is there a way to not generate british accent? my sample voice are already not british
@zyxwvutsrqponmlkh ปีที่แล้ว
My uncle works at Nintendo and he got me a 7090 ti ultimate edition
@thecysserofan ปีที่แล้ว
Hi I left a comment on another video of yours tryna install Tensorboard please help
@Jarods_Journey ปีที่แล้ว ⁺¹
You may have to check Google on what your issue is to see if anyone has run into this before
@Because_Reasons ปีที่แล้ว
This is wild wtf...
@saplemyrup ปีที่แล้ว
Cool.
@aceflamez00 ปีที่แล้ว ⁺¹
Wtf!!!

ต่อไป

เล่นอัตโนมัติ

World’s Fastest Talking AI: Deepgram + Groq