Could you maybe explain what benefit this offers over F5-TTS? The quality doesn't seem to be better but the VRAM requirement is double or more than that of F5.
Only recordings. I didn't see a way of feeding in a stream real-time with the code they have right now, though it may be theoretically possible because it takes less time to run the conversion than the duration of the clip (on my machine anyway).
You deserve a bigger following - Love these videos!
pinokio install?
Great idea. I just published github.com/chameleon-ai/vevo-pinokio
@ChameleonAI thanks boss 🙏
it's confusing detecting u'r voice(does it exits !?), place a beep or other to know that's narrative
You mean my "real" voice? I always use a voice changer, so in a sense my real voice doesn't exist.
Could you maybe explain what benefit this offers over F5-TTS? The quality doesn't seem to be better but the VRAM requirement is double or more than that of F5.
There is a TTS option but this is primarily voice-to-voice, so it's a different application. Unless I missed something, F5 doesn't have an sts mode.
@@ChameleonAI Oh, sorry. I thought it was TTS. It makes sense now. Thanks
Nice, is this real time as well or only recordings?
Only recordings. I didn't see a way of feeding in a stream real-time with the code they have right now, though it may be theoretically possible because it takes less time to run the conversion than the duration of the clip (on my machine anyway).
that american guy is missing an alveolar flap in "water" americans say wadder not waTer
Probably over-annunciating because he was doing a reading. I do that myself sometimes.
sent a discord dm 😶🌫