RIP ELEVENLABS! Create BEST TTS AI Voices LOCALLY For FREE!

  Рет қаралды 176,325

Aitrepreneur

Aitrepreneur

Күн бұрын

Пікірлер: 522
@Aitrepreneur
@Aitrepreneur 6 ай бұрын
HELLO HUMANS! Thank you for watching & do NOT forget to LIKE and SUBSCRIBE For More Ai Updates. Thx
@Refinement99
@Refinement99 6 ай бұрын
How come you are not using ai speech in your videos? I'm not trying to be mean or judge or anything, but you have a strong accent and it's sometimes hard to understand. Even though I love your content
@Sanguen666
@Sanguen666 6 ай бұрын
not bad, but tortoise is much more accurate imho. check out the video 'how charsi became a blacksmith' it was made with tortoise TTS.
@trezero
@trezero 6 ай бұрын
Would love to see how this can all be done programmatically as a next step.
@germanher7528
@germanher7528 6 ай бұрын
MEET LOCAL TTs NEAR YOU!!!! 1-800-HUGE-BOOBAS
@zirufe
@zirufe 5 ай бұрын
Hi. If I would like to add another language for training the voice for TTS. Is there any workaround or other model to be used like in this video?
@heady2905
@heady2905 6 ай бұрын
XTTS has such a good voice generation. If you repeat a sentence it sounds everytime different and if you combine it with your RVC voice model you got the best thing ever. The future of AI must be open source and it is good you show everybody how to use this powerful AI technology. Greetings from Germany 🙂
@v11cu96
@v11cu96 6 ай бұрын
Yeah I wouldnt say the 'sounding different each time', is a good thing when you want consistency though,Sorry for being negative, So far my open source TTS journey has not been great. I feel like im lucky if can generate 2 sentences in a consistent pitch and accent, maybe I need to try with an RVC model like you suggest?. Or just wait for the tech to impove a bit more.
@MrRaja
@MrRaja 5 ай бұрын
​@@v11cu96RVC method would be you recording the sentences how you'd like or using Xtts to generate a Audiofile to be used in RVC... My problem is how do I make it emotional like for example: What![Angry] Why didn't you tell me?[Desperate]
@LJames-ez9lr
@LJames-ez9lr 5 ай бұрын
@heady2905 oes this tts actually work for you? I got errors during installation and it wont launch.
@MAST
@MAST 5 ай бұрын
This comment sound like AI.
@LJames-ez9lr
@LJames-ez9lr 5 ай бұрын
@@MAST i figured it out! i only had Visual studio installed and not the tools
@GGfrostZ
@GGfrostZ Ай бұрын
Have fun in dependency hell trying to install this!
@SuperFurias
@SuperFurias 6 ай бұрын
for everyone having setup py errors: run the install.bat file wait for everything to be installed see error close the cmd open a new cmd inside the folder type "call venv\Scripts\activate" type "pip install tts" wait for everything to be installed, close cmd then run again the install.bat file no more setup py errors. don't ask me why, because i don't know
@StainCorb
@StainCorb 6 ай бұрын
This finally solved my Finetune install, now I just need a couple of days to figure out RVC version, installing things through the command prompt is a party... ... lol
@Rambo....
@Rambo.... 6 ай бұрын
👍Thanks man, you solved my problem here.
@petepablogaming243
@petepablogaming243 5 ай бұрын
What the hell. That fixed it. I also have no idea why though
@diehgo_sp
@diehgo_sp 5 ай бұрын
totally not working for me
@SuperFurias
@SuperFurias 5 ай бұрын
@@diehgo_sp sounds strange, did you do it correctly? i know for sure that installing the tts package fixes setup py errors. but maybe you are having a different error, or simply did the procedure wrong, so could you tell me exactly what error are you having, and where?
@OctoberFox
@OctoberFox 3 ай бұрын
There's a lot of information he leaves out of his instructions in many of his videos (like missing requirements). Thanks to the other commenters for some of their input, as it's been really helpful.
@ItsBrody
@ItsBrody 6 ай бұрын
Got hella complicated by the half way point. I wish you used a different persons voice, Obama's voice almost already sounds like a robot and the last sample you showed us honestly didnt sound that great.
@derekthemagician
@derekthemagician 6 ай бұрын
Whenever I see his logo image I know I want to do what he's doing but I'm not going to be able to.. Lol
@sir_no_name1478
@sir_no_name1478 5 ай бұрын
Lol for real? I mean you can stop the video when it is complicated and follow it step by step. And if it is really unclear just google the words he says. I would not use audacity to append the voice to a 2 minute clip though. There is a reason they want 2 minutes.
@jvdome
@jvdome 5 ай бұрын
Yea, i think this still too hard to do, i could follow step by step but dealing with errors is not worth the hassle if i am doing this for fun
@zachary3603
@zachary3603 5 ай бұрын
@@derekthemagician This is the only thing that makes it worth doing rn. As soon as it becomes easy, it won't stand out much.
@PACOBRYAN-cj9gf
@PACOBRYAN-cj9gf 3 ай бұрын
just summarize the vid with AI
@vi6ddarkking
@vi6ddarkking 6 ай бұрын
Between this in six months, The SD3 Fintunes, The tools that finally are getting us to consistent characters, and the Lama 3 Fintunes. This year's Sillytavern video Is going to be bonkers.
@user-jk9zr3sc5h
@user-jk9zr3sc5h 6 ай бұрын
SD3 sadly won't work with controlnet due to its lack of UNET architecture, but hopefully something similar is shipped soon
@GengoSenmon
@GengoSenmon 6 ай бұрын
How do we integrate this with SillyTavern so we can speak back and forth with the voice we generated?
@wakegary
@wakegary 6 ай бұрын
i dont know what ure talking about but i cloned this depos and so um now I need to know what ure talking about. cuz this is rad and does tortoise level at bark speeds. god we're nerds. (sonic checking watch gif) also expecting open source udio clone (odel-may eak-lay) to get git got soon. hoping! is musicgen still the top dog?
@olucassantos
@olucassantos 4 ай бұрын
I couldn't get past 2 minutes of tutorial, there were so many errors, some I solved, others were impossible to solve. 4 hours trying. Still, thank you very much for your efforts, unfortunately I gave up trying
@intelligenceservices
@intelligenceservices 2 ай бұрын
you're right, this is really badly maintained.
@siddhantshahi5027
@siddhantshahi5027 14 күн бұрын
took me 4 days to solve all the errors
@Ull3Rnet
@Ull3Rnet 6 ай бұрын
Hey @Aitrepreneur - Quick heads up on the XTTS-RVC-UI install on Win 10. It installs protobuf 5.26.1 by default, which didn't work for me. Downgrading to protobuf 3.20.0 fixed the issue. Just thought this might help others running into the same problem!
@yoniwoker
@yoniwoker 5 ай бұрын
I installed it, but the cmd window keeps closing. I press est, and cmd appears and closes quickly.
@Ull3Rnet
@Ull3Rnet 5 ай бұрын
@@yoniwoker try editing the bat file, add a new line at the bottom saying Pause , save, then run it again. Then you should be able to see the error before the window closes :)
@audiogus2651
@audiogus2651 6 ай бұрын
Woah, perfect timing on the vid, was just looking this stuff up today. Thanks homie!
@carlosgonbr
@carlosgonbr 5 ай бұрын
Unfortunately, it cannot be used, many errors occur during installation following the steps given. Code errors, things not found, different versions. things that are difficult for non-programmers to understand. what a shame.
@futaa34
@futaa34 5 ай бұрын
you are correct buddy. only a handful could have it running
@zytoh
@zytoh 5 ай бұрын
i started debugging with chatgpt, followed every step and error, and after 7 hours got it working, just dont give up, especially if its for a buisness which is why im using iut
@carlosgonbr
@carlosgonbr 5 ай бұрын
I discovered the main problem with my installation and after solving it, I installed everything without any errors. It's the Microsoft Visual C++ 14 package. It's not enough to just install Visual Studio, you have to install the package along with it, but it's not that intuitive. Look for a video "Fix: Microsoft Visual C++ 14.0 or greater is required in Python" from the "Hey, Let's Learn Something" channel that helped me. It's very simple. Then come back here and thank our friend from the channel who introduced us to this wonder.The three programs installed and are working without errors!
@popipopi3126
@popipopi3126 4 ай бұрын
took me hours to fix everything even knowing what im doing
@ChasingStars7111
@ChasingStars7111 4 ай бұрын
@@popipopi3126 how did you fix it ?
@coloryvr
@coloryvr 6 ай бұрын
Cool! I was waiting for this! Happy colored Greetinx!
@nodewizard
@nodewizard 6 ай бұрын
Goodbye Eleven Labs. They were overpriced and closed source. This XTTS model is amazing. Merci Monsieur Aitrepreneur. As a little goodbye kiss to Eleven Labs, I'm going to clone my favourite voices that they have. Lol.
@Aitrepreneur
@Aitrepreneur 6 ай бұрын
Have fun ;)
@AGI-Bingo
@AGI-Bingo 6 ай бұрын
Aaaaand 11Labs dropped SOTA SongGen
@GengoSenmon
@GengoSenmon 6 ай бұрын
How do we integrate this with SillyTavern so we can speak back and forth with the voice we generated?
@wakegary
@wakegary 6 ай бұрын
Sucks that they didn't put One Lab aside for something more traditional like textile manufacturing.
@zachary3603
@zachary3603 5 ай бұрын
Which python version did you use for this? Trying to get deepspeed to work, but it's saying Python 3.9 might be too high of a version xD
@creed4788
@creed4788 5 ай бұрын
Xtts fine tuning not working for all people! This tutorial is breake
@spejarn
@spejarn 6 ай бұрын
Having a 8GB card running LLM, SD and this thing for some roleplaying in Silly Tavern is getting silly. I hope next gen low-mid range cards has a decent amount of VRAM.
@ItsOk-mq9ex
@ItsOk-mq9ex 6 ай бұрын
4060 has 16gb variant, 5060 will probably be the same just higher cost.
@snark567
@snark567 2 ай бұрын
Got a refurbished 3090 just for the ram.
@PD-THANH
@PD-THANH 3 ай бұрын
This is a game changer! Training my own TTS model locally seemed impossible before, but this makes it surprisingly achievable. Has anyone tried using this with longer audio samples, like an audiobook narration? Curious to see how the quality scales.
@intelligenceservices
@intelligenceservices 2 ай бұрын
did you get it to install and run?
@Neyokah94
@Neyokah94 6 ай бұрын
This video is 17 minutes of "But wait, there's more!" and it's SOO good. Thanks!
@the-papaw
@the-papaw Ай бұрын
I can't get the second one to work (xtts-finetune-webui). I keep getting "error Connection errored out." when I try to do the "Step 1 Create dataset" Shows this in the DOS prompt "ERROR: Exception in ASGI application Traceback (most recent call last):
@vi6ddarkking
@vi6ddarkking 6 ай бұрын
Say Our AI overlord. A mad lad that goes by gradientai on huggingface. Made Llama 3 work with a 1 million+ token context size. Since you have a 4090 now. I am curious if you could see how far it can go on our local machines.
@geneanthony3421
@geneanthony3421 2 ай бұрын
Been loving your videos. AI is moving so fast anymore and I like to keep up.
@metanulski
@metanulski 6 ай бұрын
I like the Videos but the instalations never work. :-(
@JRis44
@JRis44 4 ай бұрын
What system you using and where you having issues?
@metanulski
@metanulski 4 ай бұрын
@@JRis44 windows, but I dont remember the exact error.
@JRis44
@JRis44 4 ай бұрын
@@metanulski have you attempted to try any of the ai apps lately? if you have brave browser we could do a walk through. I have some time today and dont feel like doing anything overly productive and dont mind helping someone today or this weekend maybe.
@varun3771
@varun3771 4 ай бұрын
@@JRis44hello if you could help me it would be good. I have alr torch 3 or higher installed but teh program insists on having torch 2.1.1
@redmatwinch
@redmatwinch 6 ай бұрын
12:45 where didi you take the index and pth file?
@Van_of_the_lake
@Van_of_the_lake 5 ай бұрын
You are the AI mentor of this generation, thanks for the hard work
@TomiTom1234
@TomiTom1234 6 ай бұрын
Awesome.. I always appreciate TTS and Voice cloning videos
@LJames-ez9lr
@LJames-ez9lr 5 ай бұрын
@TomiTom1234 did this actually work for you? none of them work for me after installing.
@TomiTom1234
@TomiTom1234 5 ай бұрын
@@LJames-ez9lr Two did work, but XTTS-RVC-UI didn't, I get a red error after installing it, sadly.
@zachary3603
@zachary3603 5 ай бұрын
@@LJames-ez9lr What errors are you getting?
@intelligenceservices
@intelligenceservices 2 ай бұрын
@@zachary3603 try installing it yourself?
@swannschilling474
@swannschilling474 6 ай бұрын
Thanks so much for this one! Cannot wait to try!! 😊
@lucifer9814
@lucifer9814 2 ай бұрын
After watching tons of such videos about these AI tools, I realized the ones making these tutorials, especially if they're programmers assume the whole world to be programmers as well. " Install this, install that, install this fucking shit ", he says, half of them don't even end up bloody installing. ALL I REALIZED IS THAT CODERS AND PROGRAMMERS HAVE NO PATIENCE WHATSOEVER AND DON'T TRY TO UNDERSTAND OTHERS FROM A LAYMAN TERM.
@Darfail
@Darfail Ай бұрын
eh believe me programmers struggle too to install anything, it's a perpetual hell
@lucifer9814
@lucifer9814 Ай бұрын
@@Darfail LoL
@yngeneer
@yngeneer 6 ай бұрын
sooo.....can it be stitched to the silytavern somehow?
@42ndMoose
@42ndMoose 6 ай бұрын
sillytavern already has a way of adding xtts extention, which has live realtime streaming. you can find that in the plugins tab in sillytavern. but then you'd have to go through a complicated process, at least for me. to put ST in staging etc
@duck-tube6786
@duck-tube6786 6 ай бұрын
I wish K covered off exactly this. How do you take your Uber TTS model and then run it in Sillytavern
@yngeneer
@yngeneer 6 ай бұрын
@@42ndMoose is there a tutorial for that?
@errorgradov8050
@errorgradov8050 6 ай бұрын
damn,fine-tuning was really heavy problem for me,now i got it thanks
@davidsmith-lv4kq
@davidsmith-lv4kq 6 ай бұрын
how much vram needed?
@tapikoBlends
@tapikoBlends Ай бұрын
amazing!
@Lil_Shoosh
@Lil_Shoosh Ай бұрын
3:23 1st Method 5:44 2nd Method 10:15 3rd method
@anoirbentanfous
@anoirbentanfous 4 ай бұрын
Now, develop this into an API for generating real-time text suitable for browser reading and audiobook listening. Ensure multilingual support, accurate number pronunciation, and handle various cases like omitting annotations and URLs.
@DarthyMaulocus
@DarthyMaulocus 3 ай бұрын
tried this got it working and i know how to get it to work on pretty much any machine now. must say you provide useful information however you miss out so many details that are necessary(you make no mention of the python versions or cuda for example but again anyone actually interested in this will persevere for a while), I managed to get it up and running partially due to your help much appreciated anyways
@magenta6
@magenta6 6 ай бұрын
Kudos to Coqui TTS for making this available!
@DiffusionStudio4k
@DiffusionStudio4k 4 ай бұрын
They are out of business unfortunately 😢
@thomasroyer5017
@thomasroyer5017 5 ай бұрын
is there a difference between your github and the original xtts-webui ?
@planetmuskvlog3047
@planetmuskvlog3047 6 ай бұрын
Yeah, but can it do foreign languages as easily as Eleven labs ‘multilingual v2 or v3?
@FluorescentApe
@FluorescentApe 6 ай бұрын
Is there a V3 for elevenlabs? I only se V2.
@planetmuskvlog3047
@planetmuskvlog3047 6 ай бұрын
@@FluorescentApe there’s “multilingual v3” now
@FluorescentApe
@FluorescentApe 6 ай бұрын
@@planetmuskvlog3047 what's weird. Can't see it. Maybe only a select portion of people can use it?
@jurandfantom
@jurandfantom 6 ай бұрын
Damn, I was hoping to hear/see something better in terms of quality since ByCloud/Jared videos. I hate to see such stagnation :/ Thanks AiT for update on the topic
@arete_
@arete_ 4 ай бұрын
15:48 for end result
@latemanparodius5133
@latemanparodius5133 6 ай бұрын
As I'm chatting in SillyTavern, I notice that the command windows try to reference emotion voice models, such as joy.pth with joy.index or surprise.pth with surprise.index. Sure, it still works without them, but do you know if those will have to be custom trained models for that character in that emotion, or is there some generalized emotion model somewhere that can be copy/pasted to multiple characters?
@Alice_Fumo
@Alice_Fumo 6 ай бұрын
Hell yes, this is exactly what I wanted!
@kylegeib9161
@kylegeib9161 6 ай бұрын
Wasn't a very good idea to repeat 37 seconds of reference audio at the start. With all the time I've used 11Labs' solution, even the ultimate version you have here doesn't sound as good as even their English V1 model.
@Aitrepreneur
@Aitrepreneur 6 ай бұрын
yeah it wasn't a good idea, just me being lazy but it still worked ok. Not sure I agree with the final result, it's very similar to an elevenlabs quality and it's free and unlimited, if you want to pay to use 11labs it's your choice, I'm giving another possibility to people who can't afford it or just want to save money for a very similar level of quality
@tanjabeckers9478
@tanjabeckers9478 6 ай бұрын
Merci ! Genial pour cette version Open ❤source
@MrPer4illo
@MrPer4illo 6 ай бұрын
Great job 👍 How about customizing LLM next?
@rachkaification
@rachkaification 6 ай бұрын
8:40 The reference audio sounds way better and closest to Obama's voice than the generated audio from it.
@MrRaja
@MrRaja 5 ай бұрын
I think someone installed a A.I. brain chip without me knowing like while i was asleep... For some reason i understood every single word in this video...😂😂
@thanesbusiness5001
@thanesbusiness5001 5 ай бұрын
after two hours, i still can't get it to launch. it opens then closes
@DaveGamesVT
@DaveGamesVT 5 ай бұрын
I'm guessing this doesn't work on AMD cards?
@LeCamionDAmar
@LeCamionDAmar 5 ай бұрын
Unfortunately the tutorial is outdated and nothing works anymore. sad :( Ok : Still works but i have to install dependency manually dont know why
@grayhamgrayhamson1466
@grayhamgrayhamson1466 6 ай бұрын
Perfect timing!
@ArabianShark
@ArabianShark 6 ай бұрын
Awesome video! I had been waiting for just this for ages! Thank you very much!
@ArmoredAnubis
@ArmoredAnubis 6 ай бұрын
Does this work in Silly tavern?
@GengoSenmon
@GengoSenmon 6 ай бұрын
This is what I wanted to know the entire video. How do we integrate this with SillyTavern so we can speak back and forth with the voice we generated?
@mauricioermel
@mauricioermel 6 ай бұрын
Why erverytime when I install XTTS FineTune it does not create the two folders base_models and finetune_models? When I run the start.bat it opens, but obviously I am not able to train any model.
@Mikes-Code
@Mikes-Code 5 ай бұрын
Requested float16 compute type, but the target device or backend do not support efficient float16 computation.
@andro1234567890100
@andro1234567890100 Ай бұрын
1. Open the xtts_demo python file using an IDE 2. change: if torch.cuda.is_available(): compute_type = "float16" else: compute_type = "float32" to: if torch.cuda.is_available(): compute_type = "int8" else: compute_type = "float32" 3. save Took me a whole day to figure out. Just needed some sleep and then went and read through the git repository's reported issues and found it in there. Seems to be working for me now.
@Ryzza5
@Ryzza5 5 ай бұрын
Any good tutorial video needs links to the required downloads.
@futaa34
@futaa34 5 ай бұрын
exactly
@DanielPartzsch
@DanielPartzsch 6 ай бұрын
Could you increase the tts quality and likeness of the second model even more with a longer audio clip than 2 minutes? Or doesn't it make any difference above this length?
@Jorvanius
@Jorvanius 5 ай бұрын
I'm wondering the same thing. Did you test it? 👀
@-Burs
@-Burs 4 ай бұрын
Cool stuff, I just wish more languages are supported.
@rickyparker2943
@rickyparker2943 6 ай бұрын
What do I do if I get this error message? ERROR: Could not find a version that satisfies the requirement torch==2.1.0 (from versions: 2.2.0+cu118, 2.2.1+cu118, 2.2.2+cu118, 2.3.0+cu118) ERROR: No matching distribution found for torch==2.1.0
@annonymat
@annonymat 6 ай бұрын
Same here!
@kevins_campfire
@kevins_campfire 6 ай бұрын
I get this error as well. google isn't being helpful so far
@vaughanbury1
@vaughanbury1 6 ай бұрын
@@kevins_campfire change 2.1.0 to 2.2.0 seems to be installing then
@AltMarc
@AltMarc 6 ай бұрын
@@kevins_campfire it probably has to do with the cu118, do you have Cuda version 11.8 installed? you can try to cheat it, by modifying the requirement file...
@FluorescentApe
@FluorescentApe 6 ай бұрын
@@AltMarc I have Cuda 11.8 installed and added to my path, but still doesn't work :/
@loszhor
@loszhor 6 ай бұрын
Thank you for the information.
@ssw4m
@ssw4m 5 ай бұрын
It couldn't be too difficult to find two minutes of Obama speaking. Why not spend a few minutes getting a longer sample, and presumably get even better results? Thanks for the demo, anyway, it's awesome tech.
@obamagaming7909
@obamagaming7909 6 ай бұрын
Would it be possible to integrate this into a python script?
@juleslincredule
@juleslincredule 6 ай бұрын
Great stuff! Anything for Mac users? Just asking... 😃
@geneanthony3421
@geneanthony3421 2 ай бұрын
Not sure why, but finetune seems to silently drop when trying to create a dataset. It runs until 100% and the process just closes without an error. Anyone else run into this issue?
@DestinyFaux
@DestinyFaux Ай бұрын
Couldn't even get it to run lol
@geneanthony3421
@geneanthony3421 Ай бұрын
@@DestinyFaux found out that my issue was that I was running in Conda not venv. Seems to work that way
@Kujamon
@Kujamon 6 ай бұрын
I get "ERROR: No matching distribution found for torch==2.1.1+cu118" when running the install.bat, despite intalling the pre-requisities
@LuminRL
@LuminRL 5 ай бұрын
ever find a fix??
@Kujamon
@Kujamon 5 ай бұрын
@@LuminRL Nope
@DarthyMaulocus
@DarthyMaulocus 3 ай бұрын
you need python 3.10 and set up path. ive got it all working any questions ask me. Its also in the issues page of github number 8 I also made a thread thats completed.
@LuminRL
@LuminRL 3 ай бұрын
@@DarthyMaulocus goat. once I get home I'll try again and see if I can work it out
@andro1234567890100
@andro1234567890100 Ай бұрын
@@DarthyMaulocus This worked. Thank you! For anyone going through the same struggle as me, don't try to download the zip files for < Py3.10.11. Py3.10.11 was the last release with a download link. Find that and you'll find the installer link.
@ROUNAK275
@ROUNAK275 3 ай бұрын
Can it work without gpu
@falco2911
@falco2911 4 ай бұрын
3:27 no module named 'requests'
@wedding_photography
@wedding_photography 5 ай бұрын
Definitely not as good as ElevenLabs, but not a bad result. Wish you had more examples, different speakers.
@Mateo906
@Mateo906 4 ай бұрын
When trying to install torch i get this error: ERROR: Could not find a version that satisfies the requirement torch==2.1.0 (from versions: none) ERROR: No matching distribution found for torch==2.1.0 Does someone knows why this happens?
@sneett7670
@sneett7670 3 ай бұрын
Thats because the python version is too new. just download the 3.10 one. I found this out the hard way.
@NT-eh6om
@NT-eh6om 2 ай бұрын
I was able to do it by installing torch and torchaudio seperately.
@intelligenceservices
@intelligenceservices 2 ай бұрын
​@@NT-eh6om where did you install those? globally, to an anaconda venv, or to the xtts venv? the installer itself is a mess which takes the confidence down a notch.
@InvadeNormandy
@InvadeNormandy 6 ай бұрын
Mines not working and keeps spitting out gradio errors despite following the instructions to the letter. webui and finetune both.
@blackpantherAI
@blackpantherAI 6 ай бұрын
where i can find AI voices already made by the community?
@Aitrepreneur
@Aitrepreneur 6 ай бұрын
google rvc models
@dumbsurvivor1
@dumbsurvivor1 4 ай бұрын
@@Aitrepreneur why don't you give the link in the description it's like you're trying to finess a little bit
@GraveUypo
@GraveUypo 5 ай бұрын
i had a setup with tortoise tts + rvc, but this seems better. thankfully it also works on linux, form just watching the video i thought it might not. my tortoise thing doesn't. i'll try it later.
@alexanderg8466
@alexanderg8466 5 ай бұрын
I had an error in the last step of downloading: "ImportError: DLL load failed while importing transformer_inference_op: The specified module could not be found."
@zSiuu
@zSiuu 4 ай бұрын
Were you able to solve it?
@streamy73
@streamy73 2 ай бұрын
Same issue
@SmexMyPocky
@SmexMyPocky 10 күн бұрын
I don't understand the point of the last step? Why are we going through RVC when it doesn make a model, just to download a reference wav?
@studiomusicflow4644
@studiomusicflow4644 3 ай бұрын
so. it needs NVidia to run? any way to run on cpu or amd?
@claytaan
@claytaan 5 ай бұрын
Do i need to install CUDA on my pc as well?
@mariokotlar303
@mariokotlar303 6 ай бұрын
Is this approach scalable to large text sizes? Like, if I tried to TTS an entire book, would that take infinite VRAM or endless dealing with 2 minute chunks or something, or would it just work?
@dziku2222
@dziku2222 4 ай бұрын
Doesn't work. At 2:21 call install.bat I get this message: Nie mo'venv' is not recognized as an internal or external command, operable program or batch file. 'pip' is not recognized as an internal or external command, operable program or batch file. 'pip' is not recognized as an internal or external command, operable program or batch file. Install deepspeed for windows for python 3.10.x and CUDA 11.8 Nie moInstall complete. Press any key to continue . . .
@GragSpcX_AI
@GragSpcX_AI 4 ай бұрын
Same here! I don’t know how to fix this issue. I’m a beginner!
@PresidentofAntifa
@PresidentofAntifa 3 ай бұрын
You in the correct folder?
@regeneric59
@regeneric59 6 ай бұрын
Does this work with any gpu? Or just nvidia
@ArthurHuizar
@ArthurHuizar 5 ай бұрын
I have AMD and CUDA is not having it.
@cavacino
@cavacino 6 ай бұрын
not working for me
@__syzygy__
@__syzygy__ 2 ай бұрын
fyi doesn't work with 3.12. If your system depends on 3.12, then you'll have to either downgrade or use pyenv. Certain dependencies, such as the torch dependency and tts itself does not work with 3.12
@noobicorn_gamer
@noobicorn_gamer 6 ай бұрын
For once, I’m not clickbaited and I’m happy i dropped by
@iamnobody-001
@iamnobody-001 13 күн бұрын
1:54 , Hi, you said to install the c++ visual studio but after i download and try to install, it offers so many option, which one do i have to install? or just the visual studio core editor only ? thanks for your help
@komakaze1
@komakaze1 4 ай бұрын
I'd like TTS voices to act. Imagine giving it a long story and it would whisper, shout, laugh, cry, express surprise, embarrassment, bravado, fear, courage, disgust, curiosity and interest through voice. Can any TTS AI do this yet?
@DarthyMaulocus
@DarthyMaulocus 3 ай бұрын
yes im working precisely on this, it requires using different models from memory, i mean sure you can switch out models hence have emotions, its just delay of switching which is the worry
@Darfail
@Darfail Ай бұрын
new advanced voice mode on chatgpt does that and...it's amazing
@benedictsforester7045
@benedictsforester7045 5 ай бұрын
finetune spits out errors like crazy. wasn't able to finish a single training
@anagnorisis2024
@anagnorisis2024 3 ай бұрын
Can this generate emotional voices like angry, sad, happy? Or it's basically just different voices but the intonations are more or less fixed?
@adriancoleman2876
@adriancoleman2876 6 ай бұрын
I cant wait till AI can recognize heavy reverb. i have ripped all of Dr Brackmans voice files from Supreme Commander in anticipation for just that day.
@zealgaming8161
@zealgaming8161 5 ай бұрын
I've been waiting to resurrect the late Tony Jay's work on The Transcendent One from Planescape Torment since forever. Highly recommend you check him out if you want a really scratchy, dark, evil god voice.
@katze_ksb
@katze_ksb Ай бұрын
fine tuning xtts encoder unfortunately throws errors ( Library cublas64_12.dll is not found or cannot be loaded)
@syedrahman1714
@syedrahman1714 Ай бұрын
same happening
@thays182
@thays182 3 ай бұрын
Are the outputs from these methods something that can be used for speech to speech in something like W-Okada? Or is that a different process?
@Tokaint
@Tokaint 3 ай бұрын
Do these have python integration? Like instead of using and downloading the webui could I jusr use code to tell it what to generate and save the file somewhere? And if yes where do I go to find out how to do that?
@mehmeterenkose3018
@mehmeterenkose3018 3 ай бұрын
When I try to fine tune xtts it gives me error: FileNotFound metadata_train.csv. How can I solve this issue?
@DarthyMaulocus
@DarthyMaulocus 3 ай бұрын
i tried with a different file and it worked i had same issue i refreshed and used a wav file
@mehmeterenkose3018
@mehmeterenkose3018 3 ай бұрын
@@DarthyMaulocus thanks.
@digitalface9055
@digitalface9055 6 ай бұрын
now I would really like to see tutorial how to fine tune your own language model and utilize it in LLMs.
@nixaristix1819
@nixaristix1819 4 ай бұрын
Thanks! Can I use these methods for audiobooks with synthetic voices?
@the-papaw
@the-papaw Ай бұрын
great info, very poor instructions. thx
@silentrobcanada
@silentrobcanada 5 ай бұрын
Thoughts on how this compares to StyleTTS 2? And can you capture / translate emotions like *sigh*, laughter and sarcasm?
@VaibhavShewale
@VaibhavShewale 5 ай бұрын
minimum system requirements?
@DanielPartzsch
@DanielPartzsch 5 ай бұрын
Why do you not just use the rvc enhancement option in the xtts WebUI directly? Is it slower or of lesser quality compared to the full RVC version?
@Shizaho
@Shizaho 2 ай бұрын
I already created a voice model with RVC. Why do I have to use a voice sample to create TTS? What sample should I use?
@redt1903
@redt1903 6 ай бұрын
Nah bro ima use this for brainrot edits🔥🔥🔥🔥🔥
@TengriTürktenYana
@TengriTürktenYana Ай бұрын
Where is the download link for FFMPEG and AUTOINSTALLER that you show in the video?
@SouthbayCreations
@SouthbayCreations 6 ай бұрын
Difficult to follow the installation instructions
@fjccommish
@fjccommish 6 ай бұрын
It's a video about sound. You have bad background music playing throughout, obscuring the sound.
Best AI Voice Generator | 2024.08
44:54
Thorsten-Voice
Рет қаралды 16 М.
INSTALL BEST UNCENSORED Roleplay TextGen UI LOCALLY in 1 CLICK!
27:49
ЛУЧШИЙ ФОКУС + секрет! #shorts
00:12
Роман Magic
Рет қаралды 29 МЛН
amazing#devil #lilith #funny #shorts
00:15
Devil Lilith
Рет қаралды 18 МЛН
Players vs Pitch 🤯
00:26
LE FOOT EN VIDÉO
Рет қаралды 120 МЛН
КОГДА К БАТЕ ПРИШЕЛ ДРУГ😂#shorts
00:59
BATEK_OFFICIAL
Рет қаралды 7 МЛН
10 AI Animation Tools You Won’t Believe are Free
16:02
Futurepedia
Рет қаралды 319 М.
ULTIMATE FREE UNCENSORED AI Model Workflow Is HERE! Start HERE!
26:17
Lm Studio Offline Voice Assistant + Applio. 100% Free
30:23
MushieKings
Рет қаралды 3,2 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,6 МЛН
How To Clone ANY Voice In Under 5 MIN w/ Eleven Labs AI
14:09
The Joe Rogan AI Experience
Рет қаралды 41 М.
INSTALL UNCENSORED TextGen Ai WebUI LOCALLY in 1 CLICK!
20:52
Aitrepreneur
Рет қаралды 454 М.
Free AI Audio Tools You Won't Believe Exist
17:22
Mike Russell
Рет қаралды 613 М.
ULTIMATE SDXL LORA Training! Get THE BEST RESULTS!
52:11
Aitrepreneur
Рет қаралды 210 М.
Learn RUBI: The Basics 101
34:15
Rubi Revolution
Рет қаралды 24
ЛУЧШИЙ ФОКУС + секрет! #shorts
00:12
Роман Magic
Рет қаралды 29 МЛН