I watched many videos about F5-TTS on youtube. You are the only one who can clearly compare the original sound and clone sound in a clear manner to the watcher. Keep up the good work!
@neuralfalcon3 ай бұрын
Glad I could help!
@dzrook12 күн бұрын
thank you thank you thank youuu. Neural Falcon you saved me a lot of time THANK YOU SO MUCH
@neuralfalcon12 күн бұрын
Glad it helped!
@mekkicharfi54543 ай бұрын
Thank you very much and especially for your patience
@Dex383-d8d2 ай бұрын
I have already tried everything in the video and it is indeed very easy to use, the AI has its problems but I guess it will improve over time, the part of cloning the voices works 100 out of 10, I managed to confuse a friend with his own voice speaking in another language which was very funny. Thank you very much for the video and for taking the time to respond to my first comment
@MR.VAN19792 ай бұрын
Your videos bring a lot of value to the community and are worthy of 1 subscription, 1 like, and 1 comment. I wish you good health and make many valuable videos for everyone to learn and follow.
@QHawk72 ай бұрын
*Great Video , thanks, Try dubbing a short documentary and import a deep voice, let's see what we can do with all available AI tools & colabs at this moment*
@dkerdnase3 ай бұрын
Thank you so much man! You're awesome!
@JerometkАй бұрын
Do you have the same but for lipsyn? Something on google collab or similar? I want to lipsync audio and a video, not an image.
@neuralfalconАй бұрын
Yes, we have Wav2Lip. github.com/Rudrabha/Wav2Lip My google Colab link: github.com/NeuralFalconYT/wav2lip
Pls make a video on how to use multi-speech option of this model, I'm having troubles using it
@neuralfalcon2 ай бұрын
11:18 watch this video kzbin.info/www/bejne/bJqTlIuJq96tb5osi=IZ8FKfAD7l0sqmgV
@neuralfalcon2 ай бұрын
Use the format {emotion_name} your_text. For example: If the emotion is "happy": {happy} I won a prize. For multiple emotions: {happy} I'm happy. {angry} I'm angry. {sad} I'm sad. There’s no set order. Just indicate the needed emotion in curly braces before each sentence, like {emotion} your_text. Make sure you label those reference audio files the same as your emotion_name.
@lsgzyt2 ай бұрын
@neuralfalcon thx for helping me out
@harshvaghanii2 ай бұрын
I've got an error in second step saying -> name 'base_path' is not defined
@neuralfalcon2 ай бұрын
Because, you forgot to run the cell above, where base_path = "/content". Run the cell above first, then run the next one afterward.
@gg6915521 күн бұрын
Can you make a colab version for Fish Speech tts?
@neuralfalcon20 күн бұрын
Someone already did it. Try this 😀 colab.research.google.com/drive/1trBvrdgyI-Ntd45ZnlT5lhGsI_HnKjC1?usp=sharing
@gg6915520 күн бұрын
@@neuralfalconI tried that one, but I can't seem to make it work. Could you...make a tutorial on how to run it? 🥺🙏
@neuralfalcon19 күн бұрын
@@gg69155 i will try
@gg6915519 күн бұрын
@@neuralfalconthank you 😊
@EphemeralInfernoАй бұрын
When I do it, it says "No module named onx"
@neuralfalconАй бұрын
yeap, new bug
@syntaxstreets3 ай бұрын
2nd audio and first model super
@snakezo42182 ай бұрын
is there a way to speak with our voice and make a transfer to this voice to reproduce the emotions of tones you know let's imagine that I play the game of an angry person can the cloned voice reproduce this angry voice ?
@neuralfalcon2 ай бұрын
Easy, Record a short, 15-second audio clip where you speak in a specific tone, like angry, sad, or happy. Use this audio as a reference in F5 TTS, and the output voice will match your chosen emotion, such as anger.
@vodkalikpatatesАй бұрын
Thank you for the video! It's really helpful! 🙌How can i use it with another model? like, i want to try with "F5-TTS-Turkish". how can i add it properly
@neuralfalconАй бұрын
Search on Google to find out if someone has trained an F5TTS model for the Turkish language or train your own model. To learn how to train in different languages watch this video: kzbin.info/www/bejne/i4CXpqaXhNSdr9Usi=uzMKfs6sdDloKU9a
@vodkalikpatatesАй бұрын
@@neuralfalcon There actually is a Turkish language model. I meant to ask how can I use that with your code, since it doesn't have custom model option in ui
@411KJB2 ай бұрын
Link no longer works. Any new links?
@neuralfalcon2 ай бұрын
colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb Or follow official instructions: github.com/SWivid/F5-TTS
@411KJB2 ай бұрын
It was PERFECT for that window though and I thank you so much.
@QHawk72 ай бұрын
Can I get this to work on kaggle?
@neuralfalcon2 ай бұрын
Yes
@QHawk72 ай бұрын
@@neuralfalcon How?
@neuralfalcon2 ай бұрын
@@QHawk7 github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb You may need to run: !sudo apt install ffmpeg Ensure you are connected to a GPU runtime. You may also need to install torch if PyTorch is not pre-installed on Kaggle by default. github.com/SWivid/F5-TTS
@hiepinh5599Ай бұрын
can i training with own voice, for example: optimus voice..
@neuralfalconАй бұрын
Yes 100%, copy this notebook and use F5-TTS colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
@hiepinh5599Ай бұрын
I checked your collab, and it doesn't work
@neuralfalconАй бұрын
It's working
@hiepinh5599Ай бұрын
@@neuralfalcon thank you, it worked. Now I have a checkpoint file trained through TTS-F5 but I don't know where to inference through, can you help me, I need python script
@Carlon152 ай бұрын
Can you make a video about how to train your model in a different language, please?
Yes, but you either need to train the model in French yourself or wait for someone else to do it. The best option right now is to pay for a service like ElevenLabs.io to clone your voice.
If you used a virtual environment delete the f5-tts folder and clean windows cache folder. or, if you used just pip to install f5 tts pip uninstall F5-TTS and clean windows cache folder.
@kanavwastaken3 ай бұрын
Can you please make it work on LightningAI bro?
@neuralfalcon3 ай бұрын
github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5-TTS-lightning-ai.ipynb Download this notebook and upload it to lightning.ai/. Make sure to switch to GPU.
Why did the page ask me for permission to use my microphone? Do not enter the pinned link, you will probably be hacked... The video seemed useful but better not risk it
@neuralfalcon2 ай бұрын
Thank you for your comment! It sounds like you might not be familiar with how Gradio applications work. The page requests microphone permission because the app needs to record or upload audio in order to clone it. Our code prioritizes recording audio before launching the app, which is why microphone access is required. If you're interested, you can learn more about this in the Gradio documentation here: www.gradio.app/docs/gradio/audio .
@Dex383-d8d2 ай бұрын
@@neuralfalcon Thank you very much for replying to my comment, I will read the documentation, it is true that I am not familiar with the application
@weini-sf3pu3 ай бұрын
when use Generate TTS, get an error " FileNotFoundError: [Errno 2] No such file or directory: 'nvidia-smi' ", can you help me ?
@neuralfalcon3 ай бұрын
@@weini-sf3pu yes send screenshot at NeuralFalcon@proton.me
@neuralfalcon3 ай бұрын
@@weini-sf3pu first you need a GPU to use 'nvidia-smi' then if you are running in a jupyter notebook '!nvidia-smi' Or if you are running in terminal just 'nvidia-smi'. Else you can skip this. Use another way to find the cuda version to install the pytorch .
@QHawk72 ай бұрын
*Is it Multi-language?*
@neuralfalcon2 ай бұрын
Only English and Chinese
@neuralfalcon2 ай бұрын
Watch this video : kzbin.info/www/bejne/i4CXpqaXhNSdr9U
@Ice_camp3 ай бұрын
uncheck remove silence
@neuralfalcon3 ай бұрын
You can uncheck the silence option, which may create silence in the generated audio .