F5-TTS and E2-TTS - AI Model That Fakes Fluent Speech

F5-TTS and E2-TTS - AI Model That Fakes Fluent Speech - Install Locally

Рет қаралды 9,961

Күн бұрын

Пікірлер: 46

@fahdmirza 11 күн бұрын

🔥Install F5-TTS on Windows Locally - kzbin.info/www/bejne/ZnmTZp14m7aeodUsi=oSqBtL-2MR_zBdOv 🔥Train F5-TTS on Your own voice data - kzbin.info/www/bejne/iIK7eX6Faqtsnsksi=9U4AEyrDKI8IwXkK 🔥F5-TTS and E2-TTS - AI Model That Fakes Fluent Speech - Install Locally - kzbin.info/www/bejne/bGqcoamKjLOAidEsi=d7QIITlqxgHMNI7C

@AdamNewcombMumbleTrash 12 күн бұрын

My dude, you have got to start selling "Let me clear the screen" T-shirts. YOU MUST!! I appreciate your work. Very informative.

@fahdmirza 12 күн бұрын

Thank you

@badrinarayanans355 12 күн бұрын

Tried it out, it's giving pretty good results. Looking forward to learn about finetuning this TTS models. Expecting a video from you.

@fahdmirza 12 күн бұрын

Great to hear!

@siddhubhai2508 Күн бұрын

Sir there is a problem, now in the f5 repo there is no test inference python file, now what to do? Please make a new video no the updated version.

@Homosapien77 8 күн бұрын

Depends if you give it isolated audio of a person talking like forty minutes worth of it the results are solid. If you are getting bad results just try getting longer footage removing backround noise and cleanup before inputting the mp3.

@fahdmirza 8 күн бұрын

Sure, thanks.

@john_blues 13 күн бұрын

They use a lot of non technical words to describe this(Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching) but it seems like it's just a voice clone app. It's good. But, just voice cloning.

@oskar4239 12 күн бұрын

Show me you have only cursory experience in TTS models without telling me you have only cursory experience in TTS models

@john_blues 12 күн бұрын

@@oskar4239 And what's wrong with that? I'm here to learn.

@fahdmirza 12 күн бұрын

Thanks for the feedback.

@slamdunk0113 11 күн бұрын

Thanks a lot ~~ Does this F5-TTS have tutorials for generating other languages like Japanese?

@fahdmirza 11 күн бұрын

Not yet, but watch this one for trying it out 🔥Install F5-TTS on Windows Locally - kzbin.info/www/bejne/ZnmTZp14m7aeodUsi=oSqBtL-2MR_zBdOv

@ZergRadio 12 күн бұрын

Is it only for English voices or any language?

@fahdmirza 12 күн бұрын

Chinese and English

@ZergRadio 11 күн бұрын

@@fahdmirza wrote "Chinese and English", Thankz mate

@TheGodParticles 10 күн бұрын

once you have done one process can this "model" or "dict" be used with open-webui for tts?

@SiD-hq2fo 12 күн бұрын

will there be guide we expecting for finetuning this model ?

@fahdmirza 12 күн бұрын

Noted.

@richardrispoli4508 13 күн бұрын

Pretty good result.

@fahdmirza 13 күн бұрын

I think so too!

@vishalkhombare 12 күн бұрын

Thanks a lot!!

@fahdmirza 12 күн бұрын

You're welcome!

@theh1ve 12 күн бұрын

The generation times are quite slow, although i guess that is use case specific! I assume it can run on GPU and crankmout faster generations?

@fahdmirza 12 күн бұрын

yes thats correct

@NickMak-m2c 10 күн бұрын

The F5/E2 is too choppy and oddly broken up, it sounds like another language trying to be English. Thank you for the demo, mate!

@fahdmirza 10 күн бұрын

Thanks for watching!

@siddarth26 11 күн бұрын

Bro. Did it work without gpu

@fahdmirza 11 күн бұрын

yes

@sidarth404 11 күн бұрын

hey bro. can you run this in google colab.

@fahdmirza 11 күн бұрын

No, but you can run it on Windows easily : 🔥Install F5-TTS on Windows Locally - kzbin.info/www/bejne/ZnmTZp14m7aeodUsi=oSqBtL-2MR_zBdOv

@siddhubhai2508 3 күн бұрын

Do you know hindi?

@fahdmirza 3 күн бұрын

No.

@LeeBrenton 11 күн бұрын

seems like it's on pinokio

@fahdmirza 11 күн бұрын

yes and I already did the video on it.

@mahaltech Күн бұрын

hello i try to flow you on video , first repo is change its not the same even i recognize i have to run speech_edit.py the output is intersection its not clear voice its give half from original voice and contain small piece of my text that i want to generate to voice