EchoMimic Magic: Audio and Landmarks Bring Portraits to Life! The BEST talking head generation app.

Рет қаралды 2,059

Күн бұрын

Readme / Instructions
drive.google.c...
#AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #EchoMimic #audiotovideo #lipsync

Пікірлер: 70

@StableAIHub 3 ай бұрын

BAT file for launching @echo off REM Change to the directory of the batch file cd /d "%~dp0" REM Activate the EchoMimic environment call conda activate echomimic REM Launch WebUI python webgui.py --server_port=3000

@IdgrafixCh 3 ай бұрын

Hi there, I tried ton install it for ComfyUI, but could not do so successfully (via Manager + copy paste all missing files from the repo). Could you please make a tutorial for ComfyUI please? 😊

@StableAIHub 3 ай бұрын

@@IdgrafixCh I am sorry, Comfy is not my cup of tea. It is a standalone install why use Comfyi. Comfy will occupy VRAM for it's own use and then this tool. Standalone means more VRAM available. That's my understanding.

@Im_that_guy_man 3 ай бұрын

I just wanna say thank you for your tutorials. great job

@StableAIHub 3 ай бұрын

Thank you for your feedback

@arron122 3 ай бұрын

👀Gonna test this one out

@ChikadorangFrog 3 ай бұрын

This AI is just the best out there in terms of quality compared to all others (Hedra, Liveportrait, Sadtalker, Hallo, V-express). It might be the right time to start saving for RTX 5090

@StableAIHub 3 ай бұрын

Ha ha ha. True that, let me also start saving. Please could you check the teeth part. Are you happy?

@StableAIHub 3 ай бұрын

I think eye blinking needs some improvement. Sometime only 1 eye blink.

@ChikadorangFrog 3 ай бұрын

@@StableAIHub Minor imperfections are ok. It's easy to edit in capcut by applying some effects. What is important for me is the skin texture similar to sadtalker.

@StableAIHub 3 ай бұрын

@@ChikadorangFrog Right. The quality is good. I wasn't expecting this good for AI.

@madmad555 Ай бұрын

the auto lip sync is cap tho, better in sadtalker

@behrampatel4872 3 ай бұрын

hi this has come up in another youtubers video but is it necessary to use conda to create virtual environment's ? From all your videos i learned that we can create a venv on our own. So will this tutorial work if we don't use conda ? Thanks, b

@StableAIHub 3 ай бұрын

The answer is long. Primarily we are using either PIP or CONDA to create virtual environment (VE). Sometimes the dependencies are very specific like some packages, python version... etc which can be easily done using conda. I don't know if this will work without conda. You need to try and let us know plz.

@behrampatel4872 3 ай бұрын

@@StableAIHub Got it. thanks for the info. Cheers, b

@Avalon19511 3 ай бұрын

Thank you for the video, unfortunately I think the big roadblock with a lot of these talking head software is optimization, it took 17 minutes for a 5 second video, imagine if you had a 3 minute video, it would take 6hrs and 12 minutes which is just not a good use of your time, hopefully in the neat future they get better

@StableAIHub 3 ай бұрын

The processing time can be significantly reduced if you use a 16GB or 24GB VRAM card. Using cloud services can further decrease the rendering time. A few months ago, the major issue was the quality, as the output would get distorted when using realistic images generated with SD. EchoMimic has surprised me with its improvements. I'm happy to see that the quality is getting better, and in due time, the speed will also improve. Unfortunately, I have the most basic laptop that only meets the minimum requirements, which explains the slow speed.

@Avalon19511 3 ай бұрын

@@StableAIHub I think you'll agree, prices being what they are, most people either have a 12 or 8 gb and I think that is where the optimization focus should be:)

@StableAIHub 3 ай бұрын

I agree what you said. I hope in due time the processing would be much faster on low VRAM cards.

@ChikadorangFrog 3 ай бұрын

@@StableAIHub i think devs plan to release a faster version of this in 1 to 2 months

@TomiTom1234 3 ай бұрын

Good tool, better than HALLO which takes longer time to process. BTW, I created a bat file to start the program easier and faster.

@ChikadorangFrog 3 ай бұрын

can you share the bat file?

@TomiTom1234 3 ай бұрын

@@ChikadorangFrog The video publisher added the code for bat file and pinned it, you can copy and paste it in a text file, then change the extension to "bat".

@ChikadorangFrog 3 ай бұрын

@@TomiTom1234 thx

@TomiTom1234 3 ай бұрын

@@ChikadorangFrog You are welcome. Don't forget to change the paths that need to be changed to match your folders.

@VintageForYou 2 ай бұрын

I have installed EchoMimic when I load an example image and audio I get an Error can you please help.🤔 Error code,,, cv2.error: OpenCV(4.10.0) :-1: error: (-5:Bad argument) in function 'resize' > - src is not a numerical tuple > - Expected Ptr for argument 'src'

@StableAIHub 2 ай бұрын

Check if the solution posted here works? github.com/BadToBest/EchoMimic/issues/102

@VintageForYou 2 ай бұрын

@@StableAIHub Got it working now from your link but it takes time to render for 5 Seconds of audio on a 12GB Graphics card and 32 GB of RAM over 20 minutes this app is similar to Hallo Time consuming.😥

@StableAIHub 2 ай бұрын

@@VintageForYou Try the accelerated version which is very fast.

@Nonewedone Ай бұрын

I got the same issue.

@Nonewedone Ай бұрын

File "webgui.py", line 169, in process_video face_img = cv2.resize(face_img, (width, height)) cv2.error: OpenCV(4.10.0) :-1: error: (-5:Bad argument) in function 'resize' > Overload resolution failed: > - src is not a numerical tuple > - Expected Ptr for argument 'src'