ChatTTS - Best Quality Open Source Text-to-Speech Model? | Tutorial + Ollama Setup

  Рет қаралды 32,598

All About AI

All About AI

Күн бұрын

Пікірлер: 88
@neolynxer
@neolynxer 3 ай бұрын
Great stuff. Please implement a "a... yeah" counter in your videos! :D should be fun.
@sophiedelavelle5958
@sophiedelavelle5958 3 ай бұрын
26 only according to the transcript
@john5s
@john5s 3 ай бұрын
I found that by setting a seed I can keep the voice sounding the same. chat = ChatTTS.Chat() chat.load_models(compile=True) # Set to True for better performance torch.manual_seed(seedNumber)
@2099EK
@2099EK 3 ай бұрын
You're a hero! I wish more people would check out this comment. Thank you for posting this!
@mariaalhosni9982
@mariaalhosni9982 2 ай бұрын
i keep trying this and its not working, do u have any specific tips?
@2099EK
@2099EK 2 ай бұрын
@@mariaalhosni9982 When I do this, it only works for the same phrase. If I change the phrase, it uses a different voice. Do you have that issue?
@mariaalhosni9982
@mariaalhosni9982 2 ай бұрын
@@2099EK totally. and i cant comprehend why? what i noticed is that some characteristics stayed the same such as gender. overall i dont think this TTS is ready to be used yet its very difficult to deal with the lack of proper documentation
@legendarystuff6971
@legendarystuff6971 3 ай бұрын
By default it generates a voice randomly from gaussian noise, you can definitely chose the voice somehow, look over the example they have on their repo and ask Opus for help. Their materials combine English and Chinese which makes it a bit annoying. On they're bilibili video, whatever bilibili is, they even clone Steve job's voice and Taylor swift. Nice find, thank you
@2099EK
@2099EK 3 ай бұрын
You'd've been a hero if you just put that here.
@SyamsQbattar
@SyamsQbattar 2 ай бұрын
Please tutorial ChaTTS + LM Studio, AnytingLLM or GPT4ALL
@glikoz
@glikoz 3 ай бұрын
How about speech-to-text ?
@joachimschoder
@joachimschoder 3 ай бұрын
The project libukai/Awesome-ChatTTS has a more extensive documentation. It is in Chinese but Google Chrome can automatically translate at least the text elements. It doesn't replace a good english documentation but it is a good starting point.
@Ms.Robot.
@Ms.Robot. 3 ай бұрын
Totally nailed this tut.This was very well explained. It was organized in order, easy to understand step-by-step instructions, and it addressed important points in case we run into obstacles.❤ (Still waiting for gpt4o voice too. I hope customizable voices are an option.)
@AllAboutAI
@AllAboutAI 3 ай бұрын
thansk a lot :D really appericiate it
@afaha2214
@afaha2214 2 ай бұрын
@@AllAboutAI what gpu are you using and how much VRAM do you need to follow along with this tutorial? can you share some details about your hardware?
@limebulls
@limebulls 3 ай бұрын
Can it speak german etc as well?
@mendthedivide
@mendthedivide 3 ай бұрын
It supports both Chinese and English
@davidtindell950
@davidtindell950 7 күн бұрын
Thank You! Hope U R Well and KEEP Producing Great Tutorial Vids !!!
@EricB1
@EricB1 3 ай бұрын
Great find. One suggestion: remove all [yeah] from your script.
@siddhubhai2508
@siddhubhai2508 15 күн бұрын
Moral of the video - yeah
@rohanrjoshiimakemyvid7285
@rohanrjoshiimakemyvid7285 6 күн бұрын
Can you suggest some totally free APIs that I can integrate on my website. I am looking for "videos" related API for commercial use ?
@ArseniyPotapov
@ArseniyPotapov 3 ай бұрын
XTTSv2 is also very good model, highly recommended
@Army_76-g4s
@Army_76-g4s 3 ай бұрын
Hi! If english is still experimental then french or german...
@Project_SaveTheWorld
@Project_SaveTheWorld 3 ай бұрын
It be nice if you could have it read Spanish. If it could, you'd pretty much have a translator.
@DihelsonMendonca
@DihelsonMendonca 3 ай бұрын
No Brazilian Portuguese, unfortunately 😮😮
@AllAboutAI
@AllAboutAI 3 ай бұрын
yeah only en and ch for now i think
@AEnoob
@AEnoob 3 ай бұрын
i think there is a Text Seed that lets you choose your voice
@AllAboutAI
@AllAboutAI 3 ай бұрын
oh nice, will look for it
@korni5149
@korni5149 3 ай бұрын
How can I achieve this gradient text color and text appearing animation in Windows Terminal?
@VaibhavShewale
@VaibhavShewale 3 ай бұрын
cant set the specific speaker
@AllAboutAI
@AllAboutAI 3 ай бұрын
yeah thats the issue right
@mendthedivide
@mendthedivide 3 ай бұрын
works well, but only with smaller wording/sentences. almost sounds real at times, nice find!
@AllAboutAI
@AllAboutAI 3 ай бұрын
thnx :) yeah i think the token limit is like 380
@rirelaughplus
@rirelaughplus 12 күн бұрын
@@AllAboutAI so it's not working fully local?
@catherinesalomon6342
@catherinesalomon6342 23 күн бұрын
Hall Betty White Elizabeth White Larry
@rirelaughplus
@rirelaughplus 12 күн бұрын
Which version of python is needed?
@Ginto_O
@Ginto_O 3 ай бұрын
Not good
@m3chaniewazne750
@m3chaniewazne750 Ай бұрын
Can I run it on my AI server and access it from my computer? Is there an API for this?
@Tofu3435
@Tofu3435 3 ай бұрын
I want to make an audiobook from light novels. My phones built in reader are too robotic, natural reader are needs internet access and too expensive. Maybe a program built on this can help.
@MLDQ
@MLDQ 3 ай бұрын
Is it better than realtimeTTS XTTS with coqui?
@AllAboutAI
@AllAboutAI 3 ай бұрын
like i said in the vid, i dont think I could use this for real time s t s because of the compute time
@OliNorwell
@OliNorwell 3 ай бұрын
It's interesting how the best quality settings aren't used in the basic advanced demo... I thought those later samples were very good. Not sure how it got so many stars so quick on Github though, I mean, there are alternatives that are excellent that didn't grow so quick at all
@zachary3603
@zachary3603 3 ай бұрын
What are the better alternatives everyone is using? Using Elevenlabs atm, but the speech comes out so robotic half the time.
@snatvb
@snatvb 3 ай бұрын
similar to "bark" from suno interesting which is better and perfomance
@AllAboutAI
@AllAboutAI 3 ай бұрын
noted, tnx :)
@christiandarkin
@christiandarkin 3 ай бұрын
bark's problem - when i tested it at least - was that it didn't always say what you told it to say. often it just made stuff up.
@snatvb
@snatvb 3 ай бұрын
@@christiandarkin I faced with this if I passed to long text
@stanTrX
@stanTrX 3 ай бұрын
Does it speak Turkish?
@AllAboutAI
@AllAboutAI 3 ай бұрын
i think its only english and chineese atm
@darcwader
@darcwader 3 ай бұрын
very nice
@kamalkamals
@kamalkamals 3 ай бұрын
but not support most popular languages
@Edward_ZS
@Edward_ZS 3 ай бұрын
Did anyone find a way to,select the voice?
@easternwind4435
@easternwind4435 2 ай бұрын
yeah yeah yeah
@threepe0
@threepe0 3 ай бұрын
Turtle is better I think
@elgodric
@elgodric 3 ай бұрын
Is there a webui or something for non coders
@mrpro7737
@mrpro7737 3 ай бұрын
very nice base voices i can voice chnge them to any model i want
@_zproxy
@_zproxy 3 ай бұрын
how
@AllAboutAI
@AllAboutAI 3 ай бұрын
do you know how?
@mrpro7737
@mrpro7737 3 ай бұрын
@@AllAboutAI i am using this open source project h§ttps§://youtu.§be/nXpBlC6OBw4?si=UOQ86u97CLp0BgJN for training models and voice change because its only support audio to audio and applio§.org for text to speech because it support so many accents and languages remove the § from urls , i post this comment 3 times and yt keep deleting it 😑
@mrpro7737
@mrpro7737 3 ай бұрын
@AllAboutAI i am using this open source project h§ttps§://youtu.§be/nXpBlC6OBw4?si=UOQ86u97CLp0BgJN for training models and voice change because its only support audio to audio and applio§.org for text to speech because it support so many accents and languages remove the § from urls , i post this comment 3 times and yt keep deleting it 😑
@minimin-wj8vp
@minimin-wj8vp 3 күн бұрын
how ??
@micbab-vg2mu
@micbab-vg2mu 3 ай бұрын
the quality is great - I have to try it
@AllAboutAI
@AllAboutAI 3 ай бұрын
yes give it a go :)
@DaCashRap
@DaCashRap 3 ай бұрын
Uh yeah, great video overall!
@beliebigerusername
@beliebigerusername 3 ай бұрын
how can i train it a different language? or connect it to another model?
@JNET_Reloaded
@JNET_Reloaded 3 ай бұрын
nice can you add the requirements.txt file ChatTTS IPython requests openai torchaudio torch numpy omegaconf vocos vector_quantize_pytorch transformers
@AllAboutAI
@AllAboutAI 3 ай бұрын
ill try to do that :)
@OUTLANDAH
@OUTLANDAH 3 ай бұрын
@@AllAboutAI im having the same issue when i get to adding the requirments i get an error.
@8eck
@8eck 3 ай бұрын
Don't see any weights on huggingface, so don't understand all the hype... But the code looks real.
@agatitytube
@agatitytube 3 ай бұрын
I assume, it is available only in english, right?
@mendthedivide
@mendthedivide 3 ай бұрын
It supports both Chinese and English
@dumbol8126
@dumbol8126 3 ай бұрын
chinese ml engineers are goated
@gustavheinrich5565
@gustavheinrich5565 3 ай бұрын
Just make sure you're not falling for llms with Chinese propaganda and false information baked in, when using that stuff.
@luisvictorf
@luisvictorf 3 ай бұрын
cool story about the underwater cats, would've liked to hear a bit more; it would make a good kids story! =D
@AllAboutAI
@AllAboutAI 3 ай бұрын
haha yes
@nexuslux
@nexuslux 3 ай бұрын
What’s the best real time equivalent of this?
@Resursator
@Resursator 3 ай бұрын
GPT 4o, probably.
@BORCHLEO
@BORCHLEO 3 ай бұрын
coqui/tts with the xtts model
@NathanChambers
@NathanChambers 3 ай бұрын
I've been really happy with alltalk_tts with deepspeed enabled. It's api worked good for my personal needs too. I'm no pro with tts stuff but it's been great for me. I have it reading/speaking in English, German, and Russian and does great. The only issues I've ever really had with it is if it's set in "ru" but reading "en" text.... punctuation can become demonic sounds :P It's Russian accent speaking English words is a real bonus to me too. :P
@AllAboutAI
@AllAboutAI 3 ай бұрын
yeah xtts is great
@Sanguen666
@Sanguen666 3 ай бұрын
tortoise is still better
@NirmalEleQtra
@NirmalEleQtra 3 ай бұрын
Can we have an Indian English accent or any other Indian language dialect here? If yes, how can we do it?
@kobe81
@kobe81 3 ай бұрын
ChatTTS.model.gpt:Incomplete result. hit max_new_token: 384 too bad...
GPT-4o Low Latency Screen to Voice Tutorial - SUPER IMPRESSIVE OCR!
15:40
Gemma 2 - Local RAG with Ollama and LangChain
14:42
Sam Witteveen
Рет қаралды 19 М.
Which One Is The Best - From Small To Giant #katebrush #shorts
00:17
Please Help This Poor Boy 🙏
00:40
Alan Chikin Chow
Рет қаралды 22 МЛН
Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)
20:19
Cole Medin
Рет қаралды 107 М.
host ALL your AI locally
24:20
NetworkChuck
Рет қаралды 1,1 МЛН
Python AI Voice Assistant & Agent - Full Tutorial
33:10
Tech With Tim
Рет қаралды 36 М.
Real time RAG App using Llama 3.2 and Open Source Stack on CPU
29:33
Python RAG Tutorial (with Local LLMs): AI For Your PDFs
21:33
pixegami
Рет қаралды 248 М.
Text to Speech Fine-tuning Tutorial
1:15:44
Trelis Research
Рет қаралды 3,9 М.
Which One Is The Best - From Small To Giant #katebrush #shorts
00:17