F5-TTS and E2-TTS - AI Model That Fakes Fluent Speech - Install Locally

  Рет қаралды 9,961

Fahd Mirza

Fahd Mirza

Күн бұрын

Пікірлер: 46
@fahdmirza
@fahdmirza 11 күн бұрын
🔥Install F5-TTS on Windows Locally - kzbin.info/www/bejne/ZnmTZp14m7aeodUsi=oSqBtL-2MR_zBdOv 🔥Train F5-TTS on Your own voice data - kzbin.info/www/bejne/iIK7eX6Faqtsnsksi=9U4AEyrDKI8IwXkK 🔥F5-TTS and E2-TTS - AI Model That Fakes Fluent Speech - Install Locally - kzbin.info/www/bejne/bGqcoamKjLOAidEsi=d7QIITlqxgHMNI7C
@AdamNewcombMumbleTrash
@AdamNewcombMumbleTrash 12 күн бұрын
My dude, you have got to start selling "Let me clear the screen" T-shirts. YOU MUST!! I appreciate your work. Very informative.
@fahdmirza
@fahdmirza 12 күн бұрын
Thank you
@badrinarayanans355
@badrinarayanans355 12 күн бұрын
Tried it out, it's giving pretty good results. Looking forward to learn about finetuning this TTS models. Expecting a video from you.
@fahdmirza
@fahdmirza 12 күн бұрын
Great to hear!
@siddhubhai2508
@siddhubhai2508 Күн бұрын
Sir there is a problem, now in the f5 repo there is no test inference python file, now what to do? Please make a new video no the updated version.
@Homosapien77
@Homosapien77 8 күн бұрын
Depends if you give it isolated audio of a person talking like forty minutes worth of it the results are solid. If you are getting bad results just try getting longer footage removing backround noise and cleanup before inputting the mp3.
@fahdmirza
@fahdmirza 8 күн бұрын
Sure, thanks.
@john_blues
@john_blues 13 күн бұрын
They use a lot of non technical words to describe this(Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching) but it seems like it's just a voice clone app. It's good. But, just voice cloning.
@oskar4239
@oskar4239 12 күн бұрын
Show me you have only cursory experience in TTS models without telling me you have only cursory experience in TTS models
@john_blues
@john_blues 12 күн бұрын
@@oskar4239 And what's wrong with that? I'm here to learn.
@fahdmirza
@fahdmirza 12 күн бұрын
Thanks for the feedback.
@slamdunk0113
@slamdunk0113 11 күн бұрын
Thanks a lot ~~ Does this F5-TTS have tutorials for generating other languages like Japanese?
@fahdmirza
@fahdmirza 11 күн бұрын
Not yet, but watch this one for trying it out 🔥Install F5-TTS on Windows Locally - kzbin.info/www/bejne/ZnmTZp14m7aeodUsi=oSqBtL-2MR_zBdOv
@ZergRadio
@ZergRadio 12 күн бұрын
Is it only for English voices or any language?
@fahdmirza
@fahdmirza 12 күн бұрын
Chinese and English
@ZergRadio
@ZergRadio 11 күн бұрын
@@fahdmirza wrote "Chinese and English", Thankz mate
@TheGodParticles
@TheGodParticles 10 күн бұрын
once you have done one process can this "model" or "dict" be used with open-webui for tts?
@SiD-hq2fo
@SiD-hq2fo 12 күн бұрын
will there be guide we expecting for finetuning this model ?
@fahdmirza
@fahdmirza 12 күн бұрын
Noted.
@richardrispoli4508
@richardrispoli4508 13 күн бұрын
Pretty good result.
@fahdmirza
@fahdmirza 13 күн бұрын
I think so too!
@vishalkhombare
@vishalkhombare 12 күн бұрын
Thanks a lot!!
@fahdmirza
@fahdmirza 12 күн бұрын
You're welcome!
@theh1ve
@theh1ve 12 күн бұрын
The generation times are quite slow, although i guess that is use case specific! I assume it can run on GPU and crankmout faster generations?
@fahdmirza
@fahdmirza 12 күн бұрын
yes thats correct
@NickMak-m2c
@NickMak-m2c 10 күн бұрын
The F5/E2 is too choppy and oddly broken up, it sounds like another language trying to be English. Thank you for the demo, mate!
@fahdmirza
@fahdmirza 10 күн бұрын
Thanks for watching!
@siddarth26
@siddarth26 11 күн бұрын
Bro. Did it work without gpu
@fahdmirza
@fahdmirza 11 күн бұрын
yes
@sidarth404
@sidarth404 11 күн бұрын
hey bro. can you run this in google colab.
@fahdmirza
@fahdmirza 11 күн бұрын
No, but you can run it on Windows easily : 🔥Install F5-TTS on Windows Locally - kzbin.info/www/bejne/ZnmTZp14m7aeodUsi=oSqBtL-2MR_zBdOv
@siddhubhai2508
@siddhubhai2508 3 күн бұрын
Do you know hindi?
@fahdmirza
@fahdmirza 3 күн бұрын
No.
@LeeBrenton
@LeeBrenton 11 күн бұрын
seems like it's on pinokio
@fahdmirza
@fahdmirza 11 күн бұрын
yes and I already did the video on it.
@mahaltech
@mahaltech Күн бұрын
hello i try to flow you on video , first repo is change its not the same even i recognize i have to run speech_edit.py the output is intersection its not clear voice its give half from original voice and contain small piece of my text that i want to generate to voice
@Yuuyu_play
@Yuuyu_play 12 күн бұрын
😇😇😇😇
@fahdmirza
@fahdmirza 12 күн бұрын
Thanks for the support.
@mal-avcisi9783
@mal-avcisi9783 12 күн бұрын
hey bro, can you give me 99% discount coupon for H100 ?
@fahdmirza
@fahdmirza 12 күн бұрын
sure, why not
@mal-avcisi9783
@mal-avcisi9783 11 күн бұрын
@@fahdmirza thanks man very kind
@siddhubhai2508
@siddhubhai2508 3 күн бұрын
Do you know hindi?
@fahdmirza
@fahdmirza 3 күн бұрын
No.
Google Drive hates developers now
23:56
Theo - t3․gg
Рет қаралды 88 М.
Human vs Jet Engine
00:19
MrBeast
Рет қаралды 145 МЛН
버블티로 부자 구별하는법4
00:11
진영민yeongmin
Рет қаралды 25 МЛН
Fake watermelon by Secret Vlog
00:16
Secret Vlog
Рет қаралды 30 МЛН
А что бы ты сделал? @LimbLossBoss
00:17
История одного вокалиста
Рет қаралды 11 МЛН
I Didn’t Believe that AI is the Future of Coding. I Was Right.
6:55
Sabine Hossenfelder
Рет қаралды 505 М.
HUGE - Run Models Directly from Hugging Face with Ollama Locally
8:59
F5TTS AI Voice Model Run Locally - ElevenLabs Level Open Source AI Voice Model!
12:49
Тест Ryzen AI 9 HX 370 и графики 890m
27:29
PRO Hi-Tech
Рет қаралды 115 М.
Have You Picked the Wrong AI Agent Framework?
13:10
Matt Williams
Рет қаралды 74 М.
Setting up a production ready VPS is a lot easier than I thought.
29:50
Human vs Jet Engine
00:19
MrBeast
Рет қаралды 145 МЛН