EchoMimic Magic: Audio and Landmarks Bring Portraits to Life!

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 ส.ค. 2024
  • Readme / Instructions
    drive.google.c...
    #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #EchoMimic #audiotovideo #lipsync

ความคิดเห็น • 56

  • @StableAIHub
    @StableAIHub  หลายเดือนก่อน

    BAT file for launching
    @echo off
    REM Change to the directory of the batch file
    cd /d "%~dp0"
    REM Activate the EchoMimic environment
    call conda activate echomimic
    REM Launch WebUI
    python webgui.py --server_port=3000

    • @IdgrafixCh
      @IdgrafixCh หลายเดือนก่อน +1

      Hi there, I tried ton install it for ComfyUI, but could not do so successfully (via Manager + copy paste all missing files from the repo). Could you please make a tutorial for ComfyUI please? 😊

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน

      @@IdgrafixCh I am sorry, Comfy is not my cup of tea.
      It is a standalone install why use Comfyi. Comfy will occupy VRAM for it's own use and then this tool. Standalone means more VRAM available. That's my understanding.

  • @ChikadorangFrog
    @ChikadorangFrog หลายเดือนก่อน +4

    This AI is just the best out there in terms of quality compared to all others (Hedra, Liveportrait, Sadtalker, Hallo, V-express). It might be the right time to start saving for RTX 5090

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน +2

      Ha ha ha. True that, let me also start saving.
      Please could you check the teeth part. Are you happy?

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน +1

      I think eye blinking needs some improvement. Sometime only 1 eye blink.

    • @ChikadorangFrog
      @ChikadorangFrog หลายเดือนก่อน +2

      @@StableAIHub Minor imperfections are ok. It's easy to edit in capcut by applying some effects. What is important for me is the skin texture similar to sadtalker.

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน +2

      @@ChikadorangFrog Right. The quality is good. I wasn't expecting this good for AI.

  • @arron122
    @arron122 หลายเดือนก่อน +3

    👀Gonna test this one out

  • @Im_that_guy_man
    @Im_that_guy_man หลายเดือนก่อน +2

    I just wanna say thank you for your tutorials. great job

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน

      Thank you for your feedback

  • @Avalon19511
    @Avalon19511 หลายเดือนก่อน +5

    Thank you for the video, unfortunately I think the big roadblock with a lot of these talking head software is optimization, it took 17 minutes for a 5 second video, imagine if you had a 3 minute video, it would take 6hrs and 12 minutes which is just not a good use of your time, hopefully in the neat future they get better

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน +3

      The processing time can be significantly reduced if you use a 16GB or 24GB VRAM card. Using cloud services can further decrease the rendering time. A few months ago, the major issue was the quality, as the output would get distorted when using realistic images generated with SD. EchoMimic has surprised me with its improvements. I'm happy to see that the quality is getting better, and in due time, the speed will also improve.
      Unfortunately, I have the most basic laptop that only meets the minimum requirements, which explains the slow speed.

    • @Avalon19511
      @Avalon19511 หลายเดือนก่อน +3

      @@StableAIHub I think you'll agree, prices being what they are, most people either have a 12 or 8 gb and I think that is where the optimization focus should be:)

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน +3

      I agree what you said. I hope in due time the processing would be much faster on low VRAM cards.

    • @ChikadorangFrog
      @ChikadorangFrog หลายเดือนก่อน +2

      @@StableAIHub
      i think devs plan to release a faster version of this in 1 to 2 months

  • @TomiTom1234
    @TomiTom1234 หลายเดือนก่อน +2

    Good tool, better than HALLO which takes longer time to process.
    BTW, I created a bat file to start the program easier and faster.

    • @ChikadorangFrog
      @ChikadorangFrog หลายเดือนก่อน +1

      can you share the bat file?

    • @TomiTom1234
      @TomiTom1234 หลายเดือนก่อน +2

      @@ChikadorangFrog The video publisher added the code for bat file and pinned it, you can copy and paste it in a text file, then change the extension to "bat".

    • @ChikadorangFrog
      @ChikadorangFrog หลายเดือนก่อน +2

      @@TomiTom1234 thx

    • @TomiTom1234
      @TomiTom1234 หลายเดือนก่อน +2

      @@ChikadorangFrog You are welcome.
      Don't forget to change the paths that need to be changed to match your folders.

  • @ChikadorangFrog
    @ChikadorangFrog 25 วันที่ผ่านมา +1

    The quallity of the accelerated version is not good. I will just use the slower version for now

    • @StableAIHub
      @StableAIHub  25 วันที่ผ่านมา

      I noticed the same. Used the slower version for next video.
      Did you came across any tool for singing talking head.

    • @ChikadorangFrog
      @ChikadorangFrog 25 วันที่ผ่านมา +1

      @@StableAIHub next release of echomimic would have Pretrained models with better sing performance to be released

    • @StableAIHub
      @StableAIHub  23 วันที่ผ่านมา +2

      We need to keep a watch on
      ingrid789.github.io/MyTalk/
      Looks amazing

    • @ChikadorangFrog
      @ChikadorangFrog 22 วันที่ผ่านมา

      @@StableAIHub might be good to combine with Kling AI

  • @behrampatel4872
    @behrampatel4872 หลายเดือนก่อน +2

    hi this has come up in another youtubers video but is it necessary to use conda to create virtual environment's ?
    From all your videos i learned that we can create a venv on our own.
    So will this tutorial work if we don't use conda ?
    Thanks,
    b

    • @StableAIHub
      @StableAIHub  หลายเดือนก่อน +1

      The answer is long.
      Primarily we are using either PIP or CONDA to create virtual environment (VE). Sometimes the dependencies are very specific like some packages, python version... etc which can be easily done using conda.
      I don't know if this will work without conda. You need to try and let us know plz.

    • @behrampatel4872
      @behrampatel4872 หลายเดือนก่อน +1

      @@StableAIHub Got it. thanks for the info.
      Cheers,
      b

  • @VintageForYou
    @VintageForYou 5 วันที่ผ่านมา +1

    I have installed EchoMimic when I load an example image and audio I get an Error can you please help.🤔 Error code,,, cv2.error: OpenCV(4.10.0) :-1: error: (-5:Bad argument) in function 'resize'
    > - src is not a numerical tuple
    > - Expected Ptr for argument 'src'

    • @StableAIHub
      @StableAIHub  4 วันที่ผ่านมา

      Check if the solution posted here works?
      github.com/BadToBest/EchoMimic/issues/102

    • @VintageForYou
      @VintageForYou 4 วันที่ผ่านมา +1

      @@StableAIHub Got it working now from your link but it takes time to render for 5 Seconds of audio on a 12GB Graphics card and 32 GB of RAM over 20 minutes this app is similar to Hallo Time consuming.😥

    • @StableAIHub
      @StableAIHub  4 วันที่ผ่านมา

      @@VintageForYou Try the accelerated version which is very fast.

  • @ChikadorangFrog
    @ChikadorangFrog 27 วันที่ผ่านมา +2

    Is the new update working? Im having lots of errors

    • @StableAIHub
      @StableAIHub  27 วันที่ผ่านมา

      A2V with acceleration is working fine. Please could you share error screen using Drive.

    • @ChikadorangFrog
      @ChikadorangFrog 27 วันที่ผ่านมา +1

      @@StableAIHub Thanks its working fine now. The Gradio is the one that is not working

    • @StableAIHub
      @StableAIHub  25 วันที่ผ่านมา

      @@ChikadorangFrog If no one is gonna fix I will see if I can. I am not a programmer so gonna take help from AI.
      By any chance do you have the old version / earlier release of EchoMimic when it was working

    • @StableAIHub
      @StableAIHub  25 วันที่ผ่านมา

      @@ChikadorangFrog Please check the github, I posted the solution. If you can confirm on github, it can be merged in repo

    • @ChikadorangFrog
      @ChikadorangFrog 25 วันที่ผ่านมา

      @@StableAIHub i made a mistake by cloning the latest version and copy paste it to the original/old. I no longer have the old working version

  • @rahulkathuria8250
    @rahulkathuria8250 23 วันที่ผ่านมา +1

    output video isn't HD, blurry

    • @StableAIHub
      @StableAIHub  23 วันที่ผ่านมา +1

      It is trained on 512 x 512 dataset. Use upscaler to improve quality.

    • @StableAIHub
      @StableAIHub  23 วันที่ผ่านมา

      I always use 4xUltraSharp in Automatic1111. For that you need to extract all frames, upscale and then combine as video.
      You can refer the following on how to extract frames
      th-cam.com/video/2M6RC1kJeio/w-d-xo.html

    • @rahulkathuria8250
      @rahulkathuria8250 21 วันที่ผ่านมา +1

      @@StableAIHub beard is getting blurry and distorted

    • @StableAIHub
      @StableAIHub  20 วันที่ผ่านมา +1

      @@rahulkathuria8250 Do you have generated video. Please post on github

  • @rahulkathuria8250
    @rahulkathuria8250 21 วันที่ผ่านมา +1

    beard is getting blurry and distorted

    • @StableAIHub
      @StableAIHub  20 วันที่ผ่านมา

      Please post the output on github

    • @rahulkathuria8250
      @rahulkathuria8250 20 วันที่ผ่านมา +1

      @@StableAIHub you mean the video, okay but they haven't released the dataset which means they haven't trained bearded guys.