MaskGCT: Zero-Shot TTS Google Colab Tutorial

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 ม.ค. 2025

ความคิดเห็น • 25

  • @neuralfalcon
    @neuralfalcon  2 หลายเดือนก่อน

    04:39 I was mistaken, it actually works by generating audio that matches the duration you specify. For example, if you set it to 10 seconds, it will produce 10 seconds of audio, adjusting the speed as needed based on the length of your input text.

    • @Rightpath-dr7ys
      @Rightpath-dr7ys 2 หลายเดือนก่อน

      Plz make video for how can locally setup xtts in laptop but I can not knowledge about python plz make video I need pl

    • @neuralfalcon
      @neuralfalcon  2 หลายเดือนก่อน

      @@Rightpath-dr7ys Just Download Portable version.
      github.com/daswer123/xtts-webui

    • @Obaitz
      @Obaitz หลายเดือนก่อน

      I have a question, I love your videos man! so whenever I write like a pharagraph (500 words) it keeps crashing and giving an error, do you have a fix for it?

  • @beckbeckend7297
    @beckbeckend7297 2 หลายเดือนก่อน +1

    5:48 is creepy 😁😅

  • @PeterNwawuba
    @PeterNwawuba 2 หลายเดือนก่อน +1

    I just tested them both, F5-TTS is wayyy faster. I was able to generate a 10mins audio in about 1 min and 30 seconds. MaskGCT could only generate 15 seconds of audio in 25 seconds.

    • @neuralfalcon
      @neuralfalcon  2 หลายเดือนก่อน

      Yes F5-TTS is faster compared to the MaskGCT. But it also depends on which GPU you are using.

  • @bharatk6790
    @bharatk6790 2 วันที่ผ่านมา

    This better than f5tts

    • @neuralfalcon
      @neuralfalcon  วันที่ผ่านมา

      Sometime it works better 😀

  • @Obaitz
    @Obaitz หลายเดือนก่อน

    I have a question, I love your videos man! so whenever I write like a pharagraph (500 words) it keeps crashing and giving an error, do you have a fix for it?

    • @neuralfalcon
      @neuralfalcon  หลายเดือนก่อน +1

      Enable debug mode on Gradio to identify the exact error.
      I will try it myself later and let you know.

    • @neuralfalcon
      @neuralfalcon  หลายเดือนก่อน

      I fixed the bug. Check the GitHub repository and click on the Google Colab link.
      I found an error related to NLTK in Google Colab. Make sure to click on 'Long Text = True' in the Gradio app. Still, I suggest using smaller text and later merging all the cloned voices with any editing software, as there is a chance of encountering a CUDA out-of-memory error with large text. However, we can try it. I wrote some code to handle the CUDA out-of-memory error, but I have never tried it with large text.

  • @diogopereira0323
    @diogopereira0323 2 หลายเดือนก่อน

    Do you think it's possible to fine tune it for Portuguese? I wanted another TTS model besides XTTS v2 for my research

    • @neuralfalcon
      @neuralfalcon  2 หลายเดือนก่อน +2

      For now it only supports English and Chinese, they promise to upload training code
      github.com/open-mmlab/Amphion/issues/289
      I think we can train the model for different language in future. For now you need to use XTTS v2 for Portuguese

    • @neuralfalcon
      @neuralfalcon  หลายเดือนก่อน +2

      Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html

    • @diogopereira0323
      @diogopereira0323 หลายเดือนก่อน +1

      @@neuralfalconthank you!! I’ll take a look

    • @JJ-vp3bd
      @JJ-vp3bd หลายเดือนก่อน

      @@diogopereira0323 did you take a look whats the best one right now. I use xtts for now through pinokio

  • @RostinSino
    @RostinSino 2 หลายเดือนก่อน

    does this support to clone Indonesian voices? please answer🙏

    • @neuralfalcon
      @neuralfalcon  2 หลายเดือนก่อน

      NO, Only English and Chinese

    • @neuralfalcon
      @neuralfalcon  2 หลายเดือนก่อน

      You can try paid elevenlabs

    • @neuralfalcon
      @neuralfalcon  2 หลายเดือนก่อน

      You could try using Coqui TTS. Someone has trained a model on the Indonesian language, available here: github.com/ZahrizhalAli/indonesian-tts-vits

    • @RostinSino
      @RostinSino 2 หลายเดือนก่อน

      ​​@@neuralfalconNext time, please make a video that supports all languages, one of which is Indonesian🙏😥

    • @neuralfalcon
      @neuralfalcon  หลายเดือนก่อน

      Watch this video : th-cam.com/video/UO4usaOojys/w-d-xo.html