OpenSource AI Voice Clone with Emotions & Accents (F5-TTS)

แชร์
ฝัง
  • เผยแพร่เมื่อ 28 ม.ค. 2025

ความคิดเห็น • 13

  • @digiart-cgi-ai-9152
    @digiart-cgi-ai-9152 หลายเดือนก่อน +3

    Hi, thanks, I would like to see local install

    • @skillcurb
      @skillcurb  หลายเดือนก่อน +1

      We'll consider adding a local install tutorial to our future videos!

  • @minhazulmahmudriyad1559
    @minhazulmahmudriyad1559 หลายเดือนก่อน +2

    Can i train it in my native language? Do you have any idea?

    • @ostelaymetaule
      @ostelaymetaule หลายเดือนก่อน

      Yes you can, on TH-cam a guy did that for Japanese, but you will need some training data

    • @minhazulmahmudriyad1559
      @minhazulmahmudriyad1559 หลายเดือนก่อน

      @ostelaymetaule how much computational power do i need for that. And did he give any information about how the dataset should be prepared?
      Can you provide the video link here?

  • @ArulnidhiKarunanidhi
    @ArulnidhiKarunanidhi หลายเดือนก่อน +1

    what would be the use cases? we were used to ai pre-built voices for so long, if it mimics the user's voice, would'nt that be weird to hear his voice . The psychological trade-off has to be considered as well. Let's say the traditional approach b/w the user and ai voice agent, after the voice agent responds and the query requirements are met, the user tend to get a satisfaction, but lets say with this approach, even though if the agent gives the correct response, wouldn't be weird to hear it and accept it? users rely on ai agent because of the "trust ability" emotion tat agent knows better, where the "mimicking" model mimics the users voice, will the user get the same emotion they get from speaking with traditional voice agents. "This is rather a discussion than a directed question!" :)

  • @cr_cryptic
    @cr_cryptic หลายเดือนก่อน +3

    Doesn’t seem so… Natural. What exactly would be the use-case of something like this?

    • @ostelaymetaule
      @ostelaymetaule หลายเดือนก่อน

      Seems way better then 80% of human narrators or dub I encountered. I still can sometimes hear the ai trembling noise in the background, but imo it is already useful for idk, audiobooks

    • @victornem1926
      @victornem1926 หลายเดือนก่อน +1

      Huggingface configuration he used is low quality.
      If you install it locally and get the high quality model which is available to download, and configure it to generate a high quality version, then it generates very smooth, human-like voices without robotic trembles.
      P.s. Removing Silence gives worse results than not removing it. So, I recommend keeping it un-toggled.

  • @justsomeguy8982
    @justsomeguy8982 หลายเดือนก่อน +1

    you sound like a tts too dude

    • @bowonetpreneur894
      @bowonetpreneur894 หลายเดือนก่อน

      He's probably a robot.i mean.... What makes you so sure you're not a robot? You think you're human? That's what exactly how robot/cyborg thinks!!

  • @cumbiainstrumental
    @cumbiainstrumental หลายเดือนก่อน

    That’s not Spanish, I’m Spanish guy

  • @centerdevelopments3533
    @centerdevelopments3533 หลายเดือนก่อน +1

    it sound like shit