HELLO HUMANS! Thank you for watching & do NOT forget to LIKE and SUBSCRIBE For More Ai Updates. Thx
@Refinement996 ай бұрын
How come you are not using ai speech in your videos? I'm not trying to be mean or judge or anything, but you have a strong accent and it's sometimes hard to understand. Even though I love your content
@Sanguen6666 ай бұрын
not bad, but tortoise is much more accurate imho. check out the video 'how charsi became a blacksmith' it was made with tortoise TTS.
@trezero6 ай бұрын
Would love to see how this can all be done programmatically as a next step.
@germanher75286 ай бұрын
MEET LOCAL TTs NEAR YOU!!!! 1-800-HUGE-BOOBAS
@zirufe5 ай бұрын
Hi. If I would like to add another language for training the voice for TTS. Is there any workaround or other model to be used like in this video?
@GGfrostZ2 ай бұрын
Have fun in dependency hell trying to install this!
@heady29056 ай бұрын
XTTS has such a good voice generation. If you repeat a sentence it sounds everytime different and if you combine it with your RVC voice model you got the best thing ever. The future of AI must be open source and it is good you show everybody how to use this powerful AI technology. Greetings from Germany 🙂
@v11cu966 ай бұрын
Yeah I wouldnt say the 'sounding different each time', is a good thing when you want consistency though,Sorry for being negative, So far my open source TTS journey has not been great. I feel like im lucky if can generate 2 sentences in a consistent pitch and accent, maybe I need to try with an RVC model like you suggest?. Or just wait for the tech to impove a bit more.
@MrRaja6 ай бұрын
@@v11cu96RVC method would be you recording the sentences how you'd like or using Xtts to generate a Audiofile to be used in RVC... My problem is how do I make it emotional like for example: What![Angry] Why didn't you tell me?[Desperate]
@LJames-ez9lr5 ай бұрын
@heady2905 oes this tts actually work for you? I got errors during installation and it wont launch.
@MAST5 ай бұрын
This comment sound like AI.
@LJames-ez9lr5 ай бұрын
@@MAST i figured it out! i only had Visual studio installed and not the tools
@OctoberFox3 ай бұрын
There's a lot of information he leaves out of his instructions in many of his videos (like missing requirements). Thanks to the other commenters for some of their input, as it's been really helpful.
@Ull3Rnet6 ай бұрын
Hey @Aitrepreneur - Quick heads up on the XTTS-RVC-UI install on Win 10. It installs protobuf 5.26.1 by default, which didn't work for me. Downgrading to protobuf 3.20.0 fixed the issue. Just thought this might help others running into the same problem!
@yoniwoker6 ай бұрын
I installed it, but the cmd window keeps closing. I press est, and cmd appears and closes quickly.
@Ull3Rnet6 ай бұрын
@@yoniwoker try editing the bat file, add a new line at the bottom saying Pause , save, then run it again. Then you should be able to see the error before the window closes :)
@SuperFurias6 ай бұрын
for everyone having setup py errors: run the install.bat file wait for everything to be installed see error close the cmd open a new cmd inside the folder type "call venv\Scripts\activate" type "pip install tts" wait for everything to be installed, close cmd then run again the install.bat file no more setup py errors. don't ask me why, because i don't know
@StainCorb6 ай бұрын
This finally solved my Finetune install, now I just need a couple of days to figure out RVC version, installing things through the command prompt is a party... ... lol
@Rambo....6 ай бұрын
👍Thanks man, you solved my problem here.
@petepablogaming2436 ай бұрын
What the hell. That fixed it. I also have no idea why though
@diehgo_sp6 ай бұрын
totally not working for me
@SuperFurias6 ай бұрын
@@diehgo_sp sounds strange, did you do it correctly? i know for sure that installing the tts package fixes setup py errors. but maybe you are having a different error, or simply did the procedure wrong, so could you tell me exactly what error are you having, and where?
@vi6ddarkking6 ай бұрын
Between this in six months, The SD3 Fintunes, The tools that finally are getting us to consistent characters, and the Lama 3 Fintunes. This year's Sillytavern video Is going to be bonkers.
@user-jk9zr3sc5h6 ай бұрын
SD3 sadly won't work with controlnet due to its lack of UNET architecture, but hopefully something similar is shipped soon
@GengoSenmon6 ай бұрын
How do we integrate this with SillyTavern so we can speak back and forth with the voice we generated?
@wakegary6 ай бұрын
i dont know what ure talking about but i cloned this depos and so um now I need to know what ure talking about. cuz this is rad and does tortoise level at bark speeds. god we're nerds. (sonic checking watch gif) also expecting open source udio clone (odel-may eak-lay) to get git got soon. hoping! is musicgen still the top dog?
@ItsBrody6 ай бұрын
Got hella complicated by the half way point. I wish you used a different persons voice, Obama's voice almost already sounds like a robot and the last sample you showed us honestly didnt sound that great.
@derekthemagician6 ай бұрын
Whenever I see his logo image I know I want to do what he's doing but I'm not going to be able to.. Lol
@sir_no_name14786 ай бұрын
Lol for real? I mean you can stop the video when it is complicated and follow it step by step. And if it is really unclear just google the words he says. I would not use audacity to append the voice to a 2 minute clip though. There is a reason they want 2 minutes.
@jvdome5 ай бұрын
Yea, i think this still too hard to do, i could follow step by step but dealing with errors is not worth the hassle if i am doing this for fun
@zachary36035 ай бұрын
@@derekthemagician This is the only thing that makes it worth doing rn. As soon as it becomes easy, it won't stand out much.
@PACOBRYAN-cj9gf3 ай бұрын
just summarize the vid with AI
@Neyokah946 ай бұрын
This video is 17 minutes of "But wait, there's more!" and it's SOO good. Thanks!
@carlosgonbr6 ай бұрын
Unfortunately, it cannot be used, many errors occur during installation following the steps given. Code errors, things not found, different versions. things that are difficult for non-programmers to understand. what a shame.
@futaa346 ай бұрын
you are correct buddy. only a handful could have it running
@zytoh5 ай бұрын
i started debugging with chatgpt, followed every step and error, and after 7 hours got it working, just dont give up, especially if its for a buisness which is why im using iut
@carlosgonbr5 ай бұрын
I discovered the main problem with my installation and after solving it, I installed everything without any errors. It's the Microsoft Visual C++ 14 package. It's not enough to just install Visual Studio, you have to install the package along with it, but it's not that intuitive. Look for a video "Fix: Microsoft Visual C++ 14.0 or greater is required in Python" from the "Hey, Let's Learn Something" channel that helped me. It's very simple. Then come back here and thank our friend from the channel who introduced us to this wonder.The three programs installed and are working without errors!
@popipopi31265 ай бұрын
took me hours to fix everything even knowing what im doing
@ChasingStars71114 ай бұрын
@@popipopi3126 how did you fix it ?
@olucassantos4 ай бұрын
I couldn't get past 2 minutes of tutorial, there were so many errors, some I solved, others were impossible to solve. 4 hours trying. Still, thank you very much for your efforts, unfortunately I gave up trying
@intelligenceservices2 ай бұрын
you're right, this is really badly maintained.
@siddhantshahi502725 күн бұрын
took me 4 days to solve all the errors
@sz95152 күн бұрын
@@siddhantshahi5027 do you have a simple steps process?
@PD-THANH3 ай бұрын
This is a game changer! Training my own TTS model locally seemed impossible before, but this makes it surprisingly achievable. Has anyone tried using this with longer audio samples, like an audiobook narration? Curious to see how the quality scales.
@intelligenceservices2 ай бұрын
did you get it to install and run?
@the-papaw2 ай бұрын
I can't get the second one to work (xtts-finetune-webui). I keep getting "error Connection errored out." when I try to do the "Step 1 Create dataset" Shows this in the DOS prompt "ERROR: Exception in ASGI application Traceback (most recent call last):
@audiogus26516 ай бұрын
Woah, perfect timing on the vid, was just looking this stuff up today. Thanks homie!
@nodewizard6 ай бұрын
Goodbye Eleven Labs. They were overpriced and closed source. This XTTS model is amazing. Merci Monsieur Aitrepreneur. As a little goodbye kiss to Eleven Labs, I'm going to clone my favourite voices that they have. Lol.
@Aitrepreneur6 ай бұрын
Have fun ;)
@AGI-Bingo6 ай бұрын
Aaaaand 11Labs dropped SOTA SongGen
@GengoSenmon6 ай бұрын
How do we integrate this with SillyTavern so we can speak back and forth with the voice we generated?
@wakegary6 ай бұрын
Sucks that they didn't put One Lab aside for something more traditional like textile manufacturing.
@zachary36035 ай бұрын
Which python version did you use for this? Trying to get deepspeed to work, but it's saying Python 3.9 might be too high of a version xD
@coloryvr6 ай бұрын
Cool! I was waiting for this! Happy colored Greetinx!
@geneanthony34212 ай бұрын
Been loving your videos. AI is moving so fast anymore and I like to keep up.
@Mikes-Code5 ай бұрын
Requested float16 compute type, but the target device or backend do not support efficient float16 computation.
@andro1234567890100Ай бұрын
1. Open the xtts_demo python file using an IDE 2. change: if torch.cuda.is_available(): compute_type = "float16" else: compute_type = "float32" to: if torch.cuda.is_available(): compute_type = "int8" else: compute_type = "float32" 3. save Took me a whole day to figure out. Just needed some sleep and then went and read through the git repository's reported issues and found it in there. Seems to be working for me now.
@spejarn6 ай бұрын
Having a 8GB card running LLM, SD and this thing for some roleplaying in Silly Tavern is getting silly. I hope next gen low-mid range cards has a decent amount of VRAM.
@ItsOk-mq9ex6 ай бұрын
4060 has 16gb variant, 5060 will probably be the same just higher cost.
@snark5672 ай бұрын
Got a refurbished 3090 just for the ram.
@TomiTom12346 ай бұрын
Awesome.. I always appreciate TTS and Voice cloning videos
@LJames-ez9lr5 ай бұрын
@TomiTom1234 did this actually work for you? none of them work for me after installing.
@TomiTom12345 ай бұрын
@@LJames-ez9lr Two did work, but XTTS-RVC-UI didn't, I get a red error after installing it, sadly.
@zachary36035 ай бұрын
@@LJames-ez9lr What errors are you getting?
@intelligenceservices2 ай бұрын
@@zachary3603 try installing it yourself?
@Van_of_the_lake5 ай бұрын
You are the AI mentor of this generation, thanks for the hard work
@vi6ddarkking6 ай бұрын
Say Our AI overlord. A mad lad that goes by gradientai on huggingface. Made Llama 3 work with a 1 million+ token context size. Since you have a 4090 now. I am curious if you could see how far it can go on our local machines.
@DarthyMaulocus4 ай бұрын
tried this got it working and i know how to get it to work on pretty much any machine now. must say you provide useful information however you miss out so many details that are necessary(you make no mention of the python versions or cuda for example but again anyone actually interested in this will persevere for a while), I managed to get it up and running partially due to your help much appreciated anyways
@magenta66 ай бұрын
Kudos to Coqui TTS for making this available!
@DiffusionStudio4k4 ай бұрын
They are out of business unfortunately 😢
@errorgradov80506 ай бұрын
damn,fine-tuning was really heavy problem for me,now i got it thanks
@redmatwinch6 ай бұрын
12:45 where didi you take the index and pth file?
@metanulski6 ай бұрын
I like the Videos but the instalations never work. :-(
@JRis444 ай бұрын
What system you using and where you having issues?
@metanulski4 ай бұрын
@@JRis44 windows, but I dont remember the exact error.
@JRis444 ай бұрын
@@metanulski have you attempted to try any of the ai apps lately? if you have brave browser we could do a walk through. I have some time today and dont feel like doing anything overly productive and dont mind helping someone today or this weekend maybe.
@varun37714 ай бұрын
@@JRis44hello if you could help me it would be good. I have alr torch 3 or higher installed but teh program insists on having torch 2.1.1
@swannschilling4746 ай бұрын
Thanks so much for this one! Cannot wait to try!! 😊
@noobicorn_gamer6 ай бұрын
For once, I’m not clickbaited and I’m happy i dropped by
@lucifer98142 ай бұрын
After watching tons of such videos about these AI tools, I realized the ones making these tutorials, especially if they're programmers assume the whole world to be programmers as well. " Install this, install that, install this fucking shit ", he says, half of them don't even end up bloody installing. ALL I REALIZED IS THAT CODERS AND PROGRAMMERS HAVE NO PATIENCE WHATSOEVER AND DON'T TRY TO UNDERSTAND OTHERS FROM A LAYMAN TERM.
@DarfailАй бұрын
eh believe me programmers struggle too to install anything, it's a perpetual hell
@lucifer9814Ай бұрын
@@Darfail LoL
@juleslincredule6 ай бұрын
Great stuff! Anything for Mac users? Just asking... 😃
@latemanparodius51336 ай бұрын
As I'm chatting in SillyTavern, I notice that the command windows try to reference emotion voice models, such as joy.pth with joy.index or surprise.pth with surprise.index. Sure, it still works without them, but do you know if those will have to be custom trained models for that character in that emotion, or is there some generalized emotion model somewhere that can be copy/pasted to multiple characters?
@anoirbentanfous5 ай бұрын
Now, develop this into an API for generating real-time text suitable for browser reading and audiobook listening. Ensure multilingual support, accurate number pronunciation, and handle various cases like omitting annotations and URLs.
@tanjabeckers94786 ай бұрын
Merci ! Genial pour cette version Open ❤source
@creed47886 ай бұрын
Xtts fine tuning not working for all people! This tutorial is breake
@MrPer4illo6 ай бұрын
Great job 👍 How about customizing LLM next?
@Lil_ShooshАй бұрын
3:23 1st Method 5:44 2nd Method 10:15 3rd method
@planetmuskvlog30476 ай бұрын
Yeah, but can it do foreign languages as easily as Eleven labs ‘multilingual v2 or v3?
@FluorescentApe6 ай бұрын
Is there a V3 for elevenlabs? I only se V2.
@planetmuskvlog30476 ай бұрын
@@FluorescentApe there’s “multilingual v3” now
@FluorescentApe6 ай бұрын
@@planetmuskvlog3047 what's weird. Can't see it. Maybe only a select portion of people can use it?
@tapikoBlends2 ай бұрын
amazing!
@jurandfantom6 ай бұрын
Damn, I was hoping to hear/see something better in terms of quality since ByCloud/Jared videos. I hate to see such stagnation :/ Thanks AiT for update on the topic
@DanielPartzsch6 ай бұрын
Could you increase the tts quality and likeness of the second model even more with a longer audio clip than 2 minutes? Or doesn't it make any difference above this length?
@Jorvanius5 ай бұрын
I'm wondering the same thing. Did you test it? 👀
@MrRaja6 ай бұрын
I think someone installed a A.I. brain chip without me knowing like while i was asleep... For some reason i understood every single word in this video...😂😂
@Alice_Fumo6 ай бұрын
Hell yes, this is exactly what I wanted!
@mauricioermel6 ай бұрын
Why erverytime when I install XTTS FineTune it does not create the two folders base_models and finetune_models? When I run the start.bat it opens, but obviously I am not able to train any model.
@yngeneer6 ай бұрын
sooo.....can it be stitched to the silytavern somehow?
@42ndMoose6 ай бұрын
sillytavern already has a way of adding xtts extention, which has live realtime streaming. you can find that in the plugins tab in sillytavern. but then you'd have to go through a complicated process, at least for me. to put ST in staging etc
@duck-tube67866 ай бұрын
I wish K covered off exactly this. How do you take your Uber TTS model and then run it in Sillytavern
@yngeneer6 ай бұрын
@@42ndMoose is there a tutorial for that?
@LeCamionDAmar5 ай бұрын
Unfortunately the tutorial is outdated and nothing works anymore. sad :( Ok : Still works but i have to install dependency manually dont know why
@Tokaint4 ай бұрын
Do these have python integration? Like instead of using and downloading the webui could I jusr use code to tell it what to generate and save the file somewhere? And if yes where do I go to find out how to do that?
@obamagaming79096 ай бұрын
Would it be possible to integrate this into a python script?
@davidsmith-lv4kq6 ай бұрын
how much vram needed?
@rachkaification6 ай бұрын
8:40 The reference audio sounds way better and closest to Obama's voice than the generated audio from it.
@kritischinteressiert6 ай бұрын
Thank you for the great video ! Is there a proper ATS as well? Or an app, that does Dubbing like 11Labs?
@redt19036 ай бұрын
Nah bro ima use this for brainrot edits🔥🔥🔥🔥🔥
@Ryzza56 ай бұрын
Any good tutorial video needs links to the required downloads.
@futaa346 ай бұрын
exactly
@loszhor6 ай бұрын
Thank you for the information.
@Kujamon6 ай бұрын
I get "ERROR: No matching distribution found for torch==2.1.1+cu118" when running the install.bat, despite intalling the pre-requisities
@LuminRL5 ай бұрын
ever find a fix??
@Kujamon5 ай бұрын
@@LuminRL Nope
@DarthyMaulocus4 ай бұрын
you need python 3.10 and set up path. ive got it all working any questions ask me. Its also in the issues page of github number 8 I also made a thread thats completed.
@LuminRL4 ай бұрын
@@DarthyMaulocus goat. once I get home I'll try again and see if I can work it out
@andro12345678901002 ай бұрын
@@DarthyMaulocus This worked. Thank you! For anyone going through the same struggle as me, don't try to download the zip files for < Py3.10.11. Py3.10.11 was the last release with a download link. Find that and you'll find the installer link.
@thomasroyer50176 ай бұрын
is there a difference between your github and the original xtts-webui ?
@nicotvupa6 ай бұрын
Thanks! I found the colab for the first one. Are there colabs for the other 2?
@ash38444 ай бұрын
colab link pls
@adriancoleman28766 ай бұрын
I cant wait till AI can recognize heavy reverb. i have ripped all of Dr Brackmans voice files from Supreme Commander in anticipation for just that day.
@zealgaming81616 ай бұрын
I've been waiting to resurrect the late Tony Jay's work on The Transcendent One from Planescape Torment since forever. Highly recommend you check him out if you want a really scratchy, dark, evil god voice.
@ArabianShark6 ай бұрын
Awesome video! I had been waiting for just this for ages! Thank you very much!
@thanesbusiness50016 ай бұрын
after two hours, i still can't get it to launch. it opens then closes
@mariokotlar3036 ай бұрын
Is this approach scalable to large text sizes? Like, if I tried to TTS an entire book, would that take infinite VRAM or endless dealing with 2 minute chunks or something, or would it just work?
@DaveGamesVT6 ай бұрын
I'm guessing this doesn't work on AMD cards?
@nixaristix18194 ай бұрын
Thanks! Can I use these methods for audiobooks with synthetic voices?
@-Burs4 ай бұрын
Cool stuff, I just wish more languages are supported.
@ssw4m5 ай бұрын
It couldn't be too difficult to find two minutes of Obama speaking. Why not spend a few minutes getting a longer sample, and presumably get even better results? Thanks for the demo, anyway, it's awesome tech.
@arete_4 ай бұрын
15:48 for end result
@JohnRiley-r7j5 ай бұрын
Wow a finetuning WEBui is awesome,in all other apps my 8gb Vram GPU was not nearly enough for training but with this Vram usage is like you said minimum,training is pretty fast and quality is amazing! One question,what if you want to train one model to be good with multiple voices,is that even possible or you need to train new model with every new voice you are using? Thanks!
@GraveUypo5 ай бұрын
i had a setup with tortoise tts + rvc, but this seems better. thankfully it also works on linux, form just watching the video i thought it might not. my tortoise thing doesn't. i'll try it later.
@kylegeib91616 ай бұрын
Wasn't a very good idea to repeat 37 seconds of reference audio at the start. With all the time I've used 11Labs' solution, even the ultimate version you have here doesn't sound as good as even their English V1 model.
@Aitrepreneur6 ай бұрын
yeah it wasn't a good idea, just me being lazy but it still worked ok. Not sure I agree with the final result, it's very similar to an elevenlabs quality and it's free and unlimited, if you want to pay to use 11labs it's your choice, I'm giving another possibility to people who can't afford it or just want to save money for a very similar level of quality
@InvadeNormandy6 ай бұрын
Mines not working and keeps spitting out gradio errors despite following the instructions to the letter. webui and finetune both.
@andrerd9911 күн бұрын
Great video man! is there anyway possible that i can tts and transform the audio directly to my mic on discord for example with this?
@claytaan6 ай бұрын
Do i need to install CUDA on my pc as well?
@digitalface90556 ай бұрын
now I would really like to see tutorial how to fine tune your own language model and utilize it in LLMs.
@DanielPartzsch6 ай бұрын
Why do you not just use the rvc enhancement option in the xtts WebUI directly? Is it slower or of lesser quality compared to the full RVC version?
@thays1824 ай бұрын
Are the outputs from these methods something that can be used for speech to speech in something like W-Okada? Or is that a different process?
@rickyparker29436 ай бұрын
What do I do if I get this error message? ERROR: Could not find a version that satisfies the requirement torch==2.1.0 (from versions: 2.2.0+cu118, 2.2.1+cu118, 2.2.2+cu118, 2.3.0+cu118) ERROR: No matching distribution found for torch==2.1.0
@annonymat6 ай бұрын
Same here!
@kevins_campfire6 ай бұрын
I get this error as well. google isn't being helpful so far
@vaughanbury16 ай бұрын
@@kevins_campfire change 2.1.0 to 2.2.0 seems to be installing then
@AltMarc6 ай бұрын
@@kevins_campfire it probably has to do with the cu118, do you have Cuda version 11.8 installed? you can try to cheat it, by modifying the requirement file...
@FluorescentApe6 ай бұрын
@@AltMarc I have Cuda 11.8 installed and added to my path, but still doesn't work :/
@j0hngk1396 ай бұрын
Hi, do you plan to make a video on how to use LLMs and other AI things using ROCm for Radeon and Windows users?
@Aitrepreneur6 ай бұрын
well I don't have any AMD GPU so can't really show that
@ts757arse5 ай бұрын
I've settled on the fact that I either have to wait a few months for AMD (or quants to enable CPU) support or get an Nvidia card. Nvidia is still on my shit list and I won't be doing that. The company's practices haven't changed.
@ArmoredAnubis6 ай бұрын
Does this work in Silly tavern?
@GengoSenmon6 ай бұрын
This is what I wanted to know the entire video. How do we integrate this with SillyTavern so we can speak back and forth with the voice we generated?
@grayhamgrayhamson14666 ай бұрын
Perfect timing!
@iamnobody-00124 күн бұрын
1:54 , Hi, you said to install the c++ visual studio but after i download and try to install, it offers so many option, which one do i have to install? or just the visual studio core editor only ? thanks for your help
@katze_ksbАй бұрын
fine tuning xtts encoder unfortunately throws errors ( Library cublas64_12.dll is not found or cannot be loaded)
@syedrahman1714Ай бұрын
same happening
@Mateo9064 ай бұрын
When trying to install torch i get this error: ERROR: Could not find a version that satisfies the requirement torch==2.1.0 (from versions: none) ERROR: No matching distribution found for torch==2.1.0 Does someone knows why this happens?
@sneett76703 ай бұрын
Thats because the python version is too new. just download the 3.10 one. I found this out the hard way.
@NT-eh6om2 ай бұрын
I was able to do it by installing torch and torchaudio seperately.
@intelligenceservices2 ай бұрын
@@NT-eh6om where did you install those? globally, to an anaconda venv, or to the xtts venv? the installer itself is a mess which takes the confidence down a notch.
@regeneric596 ай бұрын
Does this work with any gpu? Or just nvidia
@ArthurHuizar5 ай бұрын
I have AMD and CUDA is not having it.
@falco29114 ай бұрын
3:27 no module named 'requests'
@rhythmradius6 ай бұрын
I get lost at the 6:00 point when you start talking about xtts-finetune-webui. Where is that supposed to be?
@rhythmradius5 ай бұрын
Never mind. I got it! :)
@anagnorisis20243 ай бұрын
Can this generate emotional voices like angry, sad, happy? Or it's basically just different voices but the intonations are more or less fixed?
@alexanderg84665 ай бұрын
I had an error in the last step of downloading: "ImportError: DLL load failed while importing transformer_inference_op: The specified module could not be found."
@zSiuu4 ай бұрын
Were you able to solve it?
@streamy732 ай бұрын
Same issue
@SmexMyPocky21 күн бұрын
I don't understand the point of the last step? Why are we going through RVC when it doesn make a model, just to download a reference wav?
@jello195Ай бұрын
@Aitrepreneur This makes me wonder, if it is possible to for length and if there are shortcuts to auto translate voice to voice (in the optics of easy dubbing). I'd be interested in a video about that if ever such app exists.
@blackpantherAI6 ай бұрын
where i can find AI voices already made by the community?
@Aitrepreneur6 ай бұрын
google rvc models
@dumbsurvivor15 ай бұрын
@@Aitrepreneur why don't you give the link in the description it's like you're trying to finess a little bit
@Qubot6 ай бұрын
Thanks for the tutorial, however here is 3 env, is it possible to put them all in the same env ?
@geneanthony34212 ай бұрын
Not sure why, but finetune seems to silently drop when trying to create a dataset. It runs until 100% and the process just closes without an error. Anyone else run into this issue?
@DestinyFaux2 ай бұрын
Couldn't even get it to run lol
@geneanthony34212 ай бұрын
@@DestinyFaux found out that my issue was that I was running in Conda not venv. Seems to work that way
@wedding_photography5 ай бұрын
Definitely not as good as ElevenLabs, but not a bad result. Wish you had more examples, different speakers.
@sohaibmoussaidelidrissi24310 күн бұрын
Can you do a method for live voice change ? From voice to voice, thank you for the great video.
@dziku22224 ай бұрын
Doesn't work. At 2:21 call install.bat I get this message: Nie mo'venv' is not recognized as an internal or external command, operable program or batch file. 'pip' is not recognized as an internal or external command, operable program or batch file. 'pip' is not recognized as an internal or external command, operable program or batch file. Install deepspeed for windows for python 3.10.x and CUDA 11.8 Nie moInstall complete. Press any key to continue . . .
@GragSpcX_AI4 ай бұрын
Same here! I don’t know how to fix this issue. I’m a beginner!
@PresidentofAntifa3 ай бұрын
You in the correct folder?
@studiomusicflow46443 ай бұрын
so. it needs NVidia to run? any way to run on cpu or amd?
@komakaze15 ай бұрын
I'd like TTS voices to act. Imagine giving it a long story and it would whisper, shout, laugh, cry, express surprise, embarrassment, bravado, fear, courage, disgust, curiosity and interest through voice. Can any TTS AI do this yet?
@DarthyMaulocus4 ай бұрын
yes im working precisely on this, it requires using different models from memory, i mean sure you can switch out models hence have emotions, its just delay of switching which is the worry
@DarfailАй бұрын
new advanced voice mode on chatgpt does that and...it's amazing
@landyandy5 ай бұрын
Hi alien. Do you have any tips for making XTTS WebUI voice2voice work on an older 1070 GPU? I load the voice model, choose the language and then generate. Error --> ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation. Bye and ty
@andro1234567890100Ай бұрын
1. Open the xtts_demo python file using an IDE 2. change: if torch.cuda.is_available(): compute_type = "float16" else: compute_type = "float32" to: if torch.cuda.is_available(): compute_type = "int8" else: compute_type = "float32" 3. save Took me a whole day to figure out. Just needed some sleep and then went and read through the git repository's reported issues and found it in there. Seems to be working for me now.
@ROUNAK2754 ай бұрын
Can it work without gpu
@benedictsforester70455 ай бұрын
finetune spits out errors like crazy. wasn't able to finish a single training