No video

(COLAB PRO ONLY) AI Voice Cloning with RVC in GOOGLE COLAB - Guide and Setup

  Рет қаралды 142,270

Jarods Journey

Jarods Journey

Күн бұрын

Пікірлер: 351
@Jarods_Journey
@Jarods_Journey Жыл бұрын
CURRENT Issue 9-15-2023: I was made aware that Google has started banning RVC usage on free accounts, similar to what it did to stable diffusion. There is no fix for this ATM other than to get the PRO version of Collab. IMPORTANT: You MUST click "Train feature index" at 12:07 in order to get the IVF index file you'll need later. As noted by another comment, this can be done before or after training. Sorry about that guys!
@dylonkejhu
@dylonkejhu Жыл бұрын
Thanks !
@jessie24031
@jessie24031 Жыл бұрын
what is the difference between the two? is there anything different about it?
@ALFTHADRADDAD
@ALFTHADRADDAD Жыл бұрын
good lookin out
@user-wy7qe9zh2z
@user-wy7qe9zh2z Жыл бұрын
what is the consequence? Is it train model -> train feature index -> one-click training?
@forest1605
@forest1605 Жыл бұрын
@@user-wy7qe9zh2z whats the diff between them
@realjgerard
@realjgerard Жыл бұрын
I just want to say publicly, that I appreciate you Jarod for creating all of these guides. I can tell that you’re not doing this for views and that you truly have a generous spirit. These videos will create businesses..generate revenue. I explore anyone that does so pays it forward or pays it back. I know I will… ☯️🥂🚀💯
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Thought I had left a comment on this one, but going back through comments again, really appreciate it Gerard 🙏!
@hanynagy8969
@hanynagy8969 Жыл бұрын
@@Jarods_Journey I need your help please.I just want to Update my trained model,I mean if I want to add more date(Audio files) to the model,Is it possible? Because every time I have new data I make a new model from 0 so I m tiered from that.Thanks in advance!
@mikecameron2327
@mikecameron2327 Жыл бұрын
Thanks for your videos on RVC, they were very helpful to me to get started with this. One important detail, in the video you put up a graphic telling viewers to click the train button, not one click training. This was good because I think one click training is slightly bugged, but you forgot to mention that if you don't use one click training you MUST also run feature training (2nd big button) or you won't have an .index file.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it! Man, I had to do a double take real quick, but I do say it to click it at 8:16 😅. I realized after editing that one-click training does all of the previous steps before, so it was redundant to click one-click training. If you run all the previous buttons before and click one click training, it just redos all of the previous steps. Edit: misread the comment, looks like an oversight and a missed step on my end!
@producer8587
@producer8587 11 ай бұрын
It’s now banned tho 😢
@WIDOMU
@WIDOMU Жыл бұрын
I just want to say thank you so much Jarod for making this video. I can feel the passion and kindness and you were doing it to help people not taking it for money. It really worked for me the collab I was dancing crazily when It worked. I was so happy. Thank you, I subscribed!
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Haha appreciate it, glad I was able to help you get it all up and running :D!
@WIDOMU
@WIDOMU Жыл бұрын
@@Jarods_Journey Thank you. All the best. I will be supporting your future videos! Thank you for replying too, It feel great when the author of this video replies to their fans! :D
@klaurcschwackerberg1880
@klaurcschwackerberg1880 Жыл бұрын
You did humanity a huge favor by making this tutorial , many thanks ! So much detail and well explained ! Liked and subscribed !
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it!
@GratencyaMC
@GratencyaMC Жыл бұрын
I've watched a different tutorial which didn't help me at all. Sound was robotic, and generally no much explanations whatsoever. But this one helped me, and the trained voice sounds perfect! I am extremely thankful for this. You are the best! :D
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it 🙏!
@idk7440
@idk7440 Жыл бұрын
thanks for the video bro, i've just spent 5 hours on a model of my friend and it works relatively well without much training, totally worth it
@zazyczech
@zazyczech Жыл бұрын
My last programming was in 1997 with basic (i was 12). This is whole new universe. Thank you!
@user-kw4xj2yk1g
@user-kw4xj2yk1g Жыл бұрын
Man, thank you so much for such detailed guidance! I've watched about 10 videos on the same topic but did not understand the process. You really helped me, thanks!
@SantoValentino
@SantoValentino Жыл бұрын
If anyone is looking to use RVC locally, it’s worth it. Saved me hours of training and it sounds better. Haven’t touched sovits since I installed Mangio-RVC. I had troubles with installation but after 2 days it was worth it. The CHAT channel in the ai discord helped
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Interesting fork! RVC is definitely worth it and in my experience, it just trains faster and produces better results. I gotta take a look at the architecture to see why lol.
@Cameron787
@Cameron787 Жыл бұрын
Thanks! Trying to decide if I should go local or colab. Will the local speed depend on the graphics card? I only have an RTX 2070 on my Razor Pro from 3 years back. OK its not that old but might be slower than colab? What are you running it on?
@SantoValentino
@SantoValentino Жыл бұрын
@@Cameron787 shouldn’t be bad at all. My 3060 ran fine. Even if it takes a few minutes this is crazy technology either way lol
@BeatsAudios1988
@BeatsAudios1988 Жыл бұрын
Sir, this is the error i am getting from content data set Could not parse variable and value from ""/content/drive/MyDrive/dataset/AKALEYO_NEE_vocals.zip"". Expected the line to start with a variable assignment. please help me
@Skylar-333
@Skylar-333 Жыл бұрын
I am having this same issue, I've tried everything, including just naming my zip file what the program seems to be looking for. No idea what went wrong. I don't even have the same opportunity under the "Dataset location" to edit the path as seen in the video. It is just all red saying "Could not parse variable and value from ""/content/drive/MyDrive/dataset/lulu20230327_32k.zip"". Expected the line to start with a variable assignment" and the edit symbols are greyed out and inaccessible. Not sure what has gone wrong, I followed everything precisely. Would love some help! Glad to see I'm not the only one!
@65536thRoundTable
@65536thRoundTable Жыл бұрын
weird. my comment got deleted. Anyways , put this right at the start of Install dependencies if you keep failing to build pyworld and/or not be able to find faiss !pip install pyworld==0.3.2 !pip install numpy==1.23.5
@nonnegative7063
@nonnegative7063 Жыл бұрын
My comment was deleted too, I sent whole installation line which should've fixed that
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Yt doesn't like full code comments or links, unfortunately, so it seems like it filters those out. Appreciate the fix, but we'll have to wait until it's pull-requested over on the repo or updated 🤟
@marjanamaan2109
@marjanamaan2109 Жыл бұрын
thank you so much for full detailed tutorial , after watched so many videos finally i found your video with full detail step by step. thank you so much 🙏
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it, glad I could help :D!
@DuskyRick
@DuskyRick Жыл бұрын
Thanks for this tutorial! I tried doing the RVC on my laptop locally, but it seems like my 1650 ti gpu is not as strong as I thought. Good thing I found this tutorial!
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it! All these "AI" training tools are VRAM hogs, so colab is a great alternative.
@ohmfnx2
@ohmfnx2 Жыл бұрын
tip: i use google colab for "Train" but "Model inference" & "Accompaniment and vocal separation" i use on gtx 1650
@Vateir
@Vateir Жыл бұрын
Constant connection errors on every step in the public web RVC, for a second they seem to work and then give error messages. I managed to get to feature extraction but each time it just halts the process, says there is a connection error and sometime after colab disconnects
@johanfrancis2001
@johanfrancis2001 Жыл бұрын
Hello Gradio file is not found in temp. I got error so i went to the temp folder. But gradio folder not found inside it. What should i do?
@dylonkejhu
@dylonkejhu Жыл бұрын
Hey, at 15:28 i can't find the file in the model folder. Can u help me why
@naturalbest617
@naturalbest617 Жыл бұрын
I have the same problem... any solution?
@mikecameron2327
@mikecameron2327 Жыл бұрын
If you clicked the "Train" button and not the "One click training" button you have to also click the middle button "train feature index", that's what makes the .index file you need. You can do it before clicking training or after, it doesn't matter.
@dylonkejhu
@dylonkejhu Жыл бұрын
​@@mikecameron2327 thanks :D
@RobertJene
@RobertJene Жыл бұрын
I don't care much for google collab, but I'm here for the like, comment, and the view.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it 🤟
@8gntt
@8gntt Жыл бұрын
I had an interesting error come up when I started feature extraction. This is what I was met with at the end File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/conv.py", line 309, in _conv_forward return F.conv1d(input, weight, bias, self.stride, RuntimeError: Calculated padded input size per channel: (1). Kernel size: (2). Kernel size can't be greater than actual input size all-feature-done
@Jarods_Journey
@Jarods_Journey Жыл бұрын
I believe I saw this error somewhere in the GitHub page, you might wanna check there on the RVC issues tab
@Somebodythatoverthinks
@Somebodythatoverthinks Жыл бұрын
This dude is a legend for these videos.
@RobertJene
@RobertJene Жыл бұрын
18:00 - that would be a good glitch voice for different Sci-Fi effects!
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Most definitely 😂, I'm sure its highly possible to find some use cases for it
@user-rb1bo2mh1l
@user-rb1bo2mh1l Жыл бұрын
hi the video is so straight but i have an issue i dont have the "trained_IVF201_Flat....." but i have train.log what should i do?
@bluebrun0287
@bluebrun0287 Жыл бұрын
Hey! I've been following your tutorials for quite a while, and I must say - they are helping me A LOT. But in this one, I need a little help! When I run the "start web" code that you show on 6:37 I get the error message saying: /content/Retrieval-based-Voice-Conversion-WebUI Traceback (most recent call last): File "/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py", line 7, in import faiss ModuleNotFoundError: No module named 'faiss' Have you seen something like this and how we can fix it?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Try re-running the cells in the Collab, faiss may not have installed fully or correctly
@lakshit._.sharma._
@lakshit._.sharma._ Жыл бұрын
You earned a new subscriber ❤
@Jarods_Journey
@Jarods_Journey Жыл бұрын
And hopefully I bestowed some new knowledge :)
@kingpinoftherails926
@kingpinoftherails926 Жыл бұрын
Great video, thanks for that. I saw in the description of the Voice Conversion WebUI that there is a link to hugging face space. Will you also make a video on how to get this to work there? Many thanks in advance.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it! Unfortunately, it's not hosted on hugging face and when you click the button, it just brings you to the page that you would need in order to manually install the models needed to run it locally
@user-pn6dy8ov1u
@user-pn6dy8ov1u 4 ай бұрын
Hey, everyone. I've trained a model with 400 epochs with voice samples of 10 min audio. When i'm doing the voice conversion, the words are not even pronounced correctly. The sound looks more like humming instead of speaking. What am i doing wrong? Appreciate your help.
@suga_candy_g7338
@suga_candy_g7338 Жыл бұрын
Thank you so much for this helpful tutorial! It's very detailed and easy to follow along with. I trained my model with the default total training epochs(7), save frequency(5), and batch_size for every GPU(7). The result has some static. I plan to add more samples to train with. Can you recommend the number of epochs I should do for the best results? Or is it like the higher number the better type of thing?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it! It's all data centered and a case by case basis, so really hard to generalize A good rule of thumb though is quality over quantity. If you can get 1 hour of high quality audio samples vs 10 hours of mixed quality, I would just save the time and do the 1 hour of samples (faster and higher quality). Then you can run it for more epochs if you wanna try and smoothen out the noise, or add more high quality samples (though a lot will be experimentation and seeing what works best for your voice!)
@suga_candy_g7338
@suga_candy_g7338 Жыл бұрын
@@Jarods_Journey Thank you for your help. I will give it a shot :)
@theAIsearch
@theAIsearch Жыл бұрын
Very helpful - thanks! Whats the difference bw pm, harvest, and dio?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Not too sure, but harvest produces the best results in my testing. There is a difference, I just haven't looked it up extensively lol.
@theAIsearch
@theAIsearch Жыл бұрын
@@Jarods_Journey No worries, thanks! Do you happen to know what "Search feature ratio" does? I tried setting it all the way left & right, without much difference
@Jarods_Journey
@Jarods_Journey Жыл бұрын
@@theAIsearch I think the affects accent of the voice so if at 1, it may retain more of the accent
@beatzoid
@beatzoid 11 ай бұрын
Trained IVF file didn't appear in log -> me folder. How could I solve this?
@PrivatePaul
@PrivatePaul Жыл бұрын
10:38 you say (and do) "click one-click training" but you display "do train model". so what is it now? which one is the right one?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Do train models then after training, click extract feature index
@PrivatePaul
@PrivatePaul Жыл бұрын
@@Jarods_Journey isn't that what one-click training does? (training + feature extraction with one click)
@kazamify
@kazamify Жыл бұрын
I don't see anything under the inferencing voice tab. I have refreshed the voice list. I do see the index logs file under the "path to the .index" file tab, however. Process data step ended with "end preprocess" and the feature extraction step ended with "all-feature-done". Can anyone help? Thanks for your content, Jarod.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
I might need a bit of clarification, but I believe you have everything finished that you need to train. Once the training is done, in the inference tab, you gotta click the refresh timbre button. One additional thing is check to see if the training outputted a .pth file in the weights folder, if there's nothing there, the training didn't finish correctly!
@kazamify
@kazamify Жыл бұрын
@@Jarods_Journey Yeah I when I check the weights file there isn't any .pth file. It is weird. Anyway, I watched your Google Colab tutorial for SVC and I managed to make it work! Thank you so much, mate. Keep it up.
@lanhoyc4435
@lanhoyc4435 Ай бұрын
Hi, i see error " can't open file '/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py': [Errno 2] No such file or directory" at start the web step. How can I fix it?
@randylanphear
@randylanphear 5 күн бұрын
i get the same error... have you found any fix?
@EdoVro
@EdoVro Жыл бұрын
This is really cool. Tho one thing, trying to download the multiple audio files at once completely crashed my Chrome. Luckily the Colab session was still going so I didn't lose anything.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Yup, it'll stay active for 12 hours as that's the runtime length unless you deactivate it!
@Sahgee
@Sahgee Жыл бұрын
[Edit: I searched the comments and found your response to someone else! Youre the best. Patiently waiting for my epochs to finish :) first time ever doing anything like this and im honored to have found/used your help) Original: This was such a wonderful video. My issue with the training is that it gives me the error message "filenotfound error: [erno2] no such file or directory: pretrained/f0G40k.pth" any help with this step will be greatly appreciated. It is the default load pre trained model option in step 3.
@zacharyreid7557
@zacharyreid7557 Жыл бұрын
i was getting lots of bugs on my windows laptop with amd hardware, so this is a good alternative
@krishnavamshithumma7377
@krishnavamshithumma7377 5 ай бұрын
@Jardos_Journey While I'm in Accompaniment and vocal seperation. I am not able to get any model to select like (H5 model). Can anybody please help me find my error.
@KeizerSinbad
@KeizerSinbad Жыл бұрын
I understand what everything here is except for what the MP4 file is, and is for. Can you elaborate on that?
@KeizerSinbad
@KeizerSinbad Жыл бұрын
Ah nevermind. I understand. I was wanting to use this to do the realtime with my microphone. You wouldn't need the MP4 file for that I guess.
@CosmotheDane
@CosmotheDane Жыл бұрын
Always appreciate your videos. Question: is it possible to implement a additional rounds of training on previously trained models? I know you run another round of training on the model you create, but how would you get it to recognize a model that's already be created after having completed and closed out the process? would you just re-insert the .pth and index files back into the weights/other folder and run a new training process on a new dataset?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Hmm, are you talking about adding additional training to another model, say you downloaded it off the web and you want to train it with other data? Unfortunately, I think you would run into some issues with this, but it's 100% possible as the pth files are just parameters or weights for that voice. However, even if you re-instert the .pth files in their respective folders, I believe you also need all of the other parts of the training process as well which includes the previous processed logs and all those other things. Never hurts to try though! I've never done it myself for rvc xd.
@helmutroll4773
@helmutroll4773 Жыл бұрын
@@Jarods_Journey I guess he means if you have already started a training session with let's say 20 epochs. And then you realize you want to have actually 100 epochs. Can you somehow continue training on the previous model and then just add the missing epochs from 20-100 or do you have to start all over from 0 again? Thanks for this incredibly helpful video!
@CosmotheDane
@CosmotheDane Жыл бұрын
​@@Jarods_Journey yeah, so to simplify, can I have multiple rounds of distinct trainings (i.e., dataset A, dataset B, dataset C) and have the same model retain what it learns from one to the other, or can you only train a model on one dataset, even though there may be multiple rounds (i.e, dataset A1, A2, etc...).im just trying to make the most banging model I can, if ya feel me
@JosGandos685
@JosGandos685 11 ай бұрын
Thank you so much mate. But I've got a situation here. after accompaniment and vocal separation and press CONVERT, I always got Connection errored Out. and can't convert. what is that? what i gonna do?
@HR-zg9ci
@HR-zg9ci Жыл бұрын
Can you explain what the "dataset" at the beginning is for? Do I need this to define in the google colab section or is it just the same like defining the folder in the training tab under step2a > "input training folder path"? Thanks for the great content, it's a big help!
@Jarods_Journey
@Jarods_Journey Жыл бұрын
dataset contains all of the data to be copied over to the colab repo, to then get trained on. You'll just set all of the things as shown in the vid for paths.
@nightknight8651
@nightknight8651 7 ай бұрын
I have to say your tutorial is very useful so thank you for it but I have a question the colab notebook seems to run an older version of RVC where there was no rvmpe so how to run the latest version on colab?
@Sahgee
@Sahgee Жыл бұрын
Your video was so helpful!!! I wanted to ask if we come back another day to use our model, after loading her up on colab, do we have to train her again or can we just jump into the model interface step?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
As long as it's trained, just inference :)
@tomtornados6236
@tomtornados6236 Жыл бұрын
The python console in the Collab is throwing me errors about non-existent modules whenever I click on the "start web" cell. Why is this?
@rzxv3
@rzxv3 Жыл бұрын
You should cover the new mangio-crepe fork, it’s way better at making models and has RVC v2
@The_Quipt
@The_Quipt Жыл бұрын
Does anyone know how to stop the voice changer from saying what I say twice?
@eventfakt
@eventfakt Жыл бұрын
Hello, when I use collab, the most time-consuming connection is suddenly disconnected, and when I try to connect again, it does not work at all. This issue has been happening to me for several days, please give me a solution so that I can use it again.
@divaki_writing
@divaki_writing Жыл бұрын
kzbin.info/www/bejne/b6jYZ3-JqsqSmrc The gradio folder did not pop up for me, what's the issue?
@davidportilla4377
@davidportilla4377 Жыл бұрын
x2
@Jarods_Journey
@Jarods_Journey Жыл бұрын
The may have been an error or issue when trying to do the audio file, on colab, I'm not too sure what else
@gsharks3333
@gsharks3333 Жыл бұрын
Im receiving this error ERROR: Failed building wheel for pyworld Failed to build pyworld ERROR: Could not build wheels for pyworld, which is required to install pyproject.toml-based projects any idea why?
@dusandss
@dusandss 5 ай бұрын
Hi Jarods, I am using Colab Pro, and didnt have issue at first, but now everytime I want to train model, it always shows me error, because two log folders are missing: 2a_f0 and 2b-f0nsf. If I add them manually, training will proceed to the end, but they will remain empty and my model won't change voice on cover song successfully. I'm trying to solve the issue past three days, but without success. Can you help me with that? Why are these folders missing now, and not before, and how this can be solved? Am I doing something wrong? Thanks in advance!
@MrH4nky
@MrH4nky Жыл бұрын
So, theoretically, if it disconnects because of 12 hours period, how am I supposed to finish extraction and training faster? Or there're some checkpoints for data, so I can keep going from the previous one? ('Cause it threw me error at first try and after that collab didn't want to start again)
@Nabuuug
@Nabuuug Жыл бұрын
Wait, just to be sure, is the "vocals" part for singing or something like that ? If I'm only interested in voices that talk, can I skip this ? Because i don't have singing recordings of the voice i want to clone, i only have the voice samples in the zip file where the voice is talking.... I'm confused EDIT : AH OKAY, yeah so the video file is not a voice sample to be cloned, it's a voice that we want to replace with the voice from our trained model!! Ok I get it now, so yeah because I want to do the microphone real time thing of the other video, I don't care about inferencing my trained model on this additional video file like you do. Good.
@nick22552
@nick22552 Жыл бұрын
Traceback (most recent call last): File "/content/-EVC-/extract_feature_print.py", line 13, in version = sys.argv[6] IndexError: list index out of range ['extract_f0_print.py', '/content/-EVC-/logs/Cristi', '2', 'crepe', '115'] how do i fix this
@bluelight6672
@bluelight6672 7 ай бұрын
Is there a reason why the program can't locate my dataset folder in my drive? Does it have to do with it being banned?
@eliasdaviddiazfrancisco5341
@eliasdaviddiazfrancisco5341 Жыл бұрын
Nice, but the index file does not apear in logs, I the only problem I have had
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Try clicking the middle button that trains feature index
@Hibabiii
@Hibabiii Жыл бұрын
how to continue training my model ... I had trained it for 200 epochs ... but it still not that good ... Is there any way to continue training it ... Or should I train a new model ???
@kurotesuta
@kurotesuta Жыл бұрын
Is there TTS for RVC?
@echofloripa
@echofloripa 11 ай бұрын
I'm running on pro version, I'm getting the following error: I added a single .wav file to the /content/dataset-2/ folder inside my collab start preprocess ['trainset_preprocess_pipeline_print.py', '/content/dataset-2', '40000', '12', '/content/Retrieval-based-Voice-Conversion-WebUI/logs/me', 'False'] /content/dataset-2/.ipynb_checkpoints->Traceback (most recent call last): File "/content/Retrieval-based-Voice-Conversion-WebUI/my_utils.py", line 14, in load_audio ffmpeg.input(file, threads=0) File "/usr/local/lib/python3.10/dist-packages/ffmpeg/_run.py", line 325, in run raise Error('ffmpeg', out, err) ffmpeg._run.Error: ffmpeg error (see stderr output for detail) During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/content/Retrieval-based-Voice-Conversion-WebUI/trainset_preprocess_pipeline_print.py", line 75, in pipeline audio = load_audio(path, self.sr) File "/content/Retrieval-based-Voice-Conversion-WebUI/my_utils.py", line 19, in load_audio raise RuntimeError(f"Failed to load audio: {e}") RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail) /content/dataset-2/record.wav->Suc. end preprocess
@Jarods_Journey
@Jarods_Journey 11 ай бұрын
The preprocess seems to have worked, but make sure there are no other files in there and no spaces in your path. This is usually the fix that I see for people.
@ishitagupta6369
@ishitagupta6369 11 ай бұрын
You may be executing code that is disallowed which may terminate your runtime without warning. Colab prioritizes interactive notebook compute and disallows some types of usage when executing code without compute units as outlined in the FAQ. Your compute unit balance is 0.Purchase more I am continuously getting this message Please help
@SKYGGEMUSIC
@SKYGGEMUSIC Жыл бұрын
@Jarods_Journey I have a question about the dataset that you used to train the main model. Is it big? How big? I train french voices and they sing with an english accent! Cute, but sometimes weird!! I was wondering how to have a big french model and how many data was needed to do so. BTW huge congrats, your work is amazing. Love it.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it! The datasets I use range from 10 minutes to 3 hours, but my average is around an hour of audio. In my experience, the index file controls the "accent" so you can try adjusting it to 1 and seeing if it results in a better accent
@SKYGGEMUSIC
@SKYGGEMUSIC Жыл бұрын
@@Jarods_Journey that's correct, english accent has almost gone. thx
@animeshindia
@animeshindia Жыл бұрын
Please Make a tutorial on mixing two rvc models using ckpt.
@trush1090
@trush1090 7 ай бұрын
I trained it for 200 epochs, how do I reuse that model to train it to 400 without having to start it from 0 epochs again?
@vuchithanh
@vuchithanh 16 күн бұрын
Cám ơn bạn quá phức tạp không làm theo được khi web hiển thị không giống như hướng dẫn
@_nothinghappens1548
@_nothinghappens1548 Жыл бұрын
Hey! At 14:05 when I click on refresh the vocals, mine doesnt get refreshed. More files arent appearing like in the tutorial and im also getting another error besides the me_zip. I also get a "/content/drive/MyDrive/dataset/.ipynb_checkpoints: Is a directory" error. Can someone please help me?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Do you have any files that are .ipynb_checkpoints in your datasets folder? Not too sure unfortunately if files aren't appearing, could be possible some parts of the process never finished.
@theentirecircus6623
@theentirecircus6623 Жыл бұрын
Great Tutorial. I'm having some problems with the last part, even after using a 2 min. short audio (inference) I'm getting the timeouts and there is also no folder with the name Gradio in the TEMP folder. There are only couple of INFO messages in colab and nothing else.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Hmm, not sure what's happening here, it might not actually be processing then. Have you tried restarting runtime?
@theentirecircus6623
@theentirecircus6623 Жыл бұрын
​@@Jarods_Journey actually not, I only tried rerunning the cell. I'll try restarting the runtime as well, then post the results here
@obeyoutube
@obeyoutube Жыл бұрын
@@theentirecircus6623 hi ! I have the same issue. I've restarted Runtime a few times but it doesn't help. My TEMP folder is empty. Did you solve the problem?
@theentirecircus6623
@theentirecircus6623 Жыл бұрын
@@obeyoutube I've just tried again with another notebook and it worked (I've waited couple of minutes after getting the timeout error). @Jarods_Journey I can link the notebook if it's okay, but it's not from the original repo
@hedwig7s
@hedwig7s Жыл бұрын
@@theentirecircus6623 Link it please
@ettiennelane9173
@ettiennelane9173 2 ай бұрын
Wil it work with the Google Colab Pay As You Go options?
@TheDenVx
@TheDenVx Жыл бұрын
Is there any way to move GPU work for AI voice changer in to Google Colab? Cuz im getting 50-60% usage on RTX 3070 with Voice Changer 😂
@Prodigy-Chaos
@Prodigy-Chaos 11 ай бұрын
So do you train like for example 1 1/2 hour of your own voice with the harvard sentences(or maybe something better I'm unaware of) and then use 1 1/2 hour of a mp4 vocals or whatever voice you're trying to clone to go into the realtime AI voice changer?
@Jarods_Journey
@Jarods_Journey 11 ай бұрын
You can do that, but the voice samples don't matter as long as they're complete words and there's no background sound. Once an RVC model is trained, that is what goes into the voice changer/
@pujabanchu8239
@pujabanchu8239 Жыл бұрын
All process of training complete but in Weights i can not see anything (.pth file)....why?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
That means there was some type of error in the training process and it never got to outputting a final trained model. In this case, you'll have to run training again for the model
@megagamer2874
@megagamer2874 11 ай бұрын
Is it possible to do this on the windows terminal because whenever I try to do it, I keep loosing connection so is there a more permanent solution?
@helmutroll4773
@helmutroll4773 Жыл бұрын
Question when executing "restore pth from google drive": After re-importing from GDrive to goolgecolab the *.index and the *.npy file are uploaded into the "content" folder of googlecolab. But is this the correct folder where those file should be in the end? Because afterwards when I am in the "model inference" tab, I have to choose then the "Feature search database file". I should then link to the *.index file which is now lying under "content", is that correct? and: how many epochs do you think are perfect? is it possible to overtrain and get bad results when putting the epoch too high? let's say eg. 200? Thank youuuu!
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Yes, when you reupload back to colab, you have to copy all of the correct file paths to the spots where they belong at. There is no perfect epoch count as it is dependent on data, I haven't seen any instance of overtraining and it's not that severe when training on these small datasets.
@VinixTKOC
@VinixTKOC Жыл бұрын
Until yesterday it was working, now it isn't working anymore. The main errors that I noticed: "error: subprocess-exited-with-error" "Building wheel for pyworld (pyproject.toml) did not run successfully." "Building wheel for pyworld (pyproject.toml) ... error" "ERROR: Failed building wheel for pyworld" "Failed to build pyworld" "ERROR: Could not build wheels for pyworld, which is required to install pyproject.toml-based projects" "ModuleNotFoundError: No module named 'faiss'"
@TheBestKindOfFailure
@TheBestKindOfFailure Жыл бұрын
That issue arose for me around the exact same time. Cannot get it to build that wheel all of the sudden, but it's not a ""ModuleNotFoundError" with me.
@deepsacheti742
@deepsacheti742 10 ай бұрын
Hi man! Just a query! I have trained my voice model for narration yesterday. Now, I would like to convert a TTS voice to my voice. I followed your instructions restored the path before going to web interface. Now, as you mentioned we have to put the path address of IVF file in database file path but now, I only see G and D file of my model when I am going to RVC - logs - my project. Can you please help on where to find the ivf file when we are coming on the next day.
@Jarods_Journey
@Jarods_Journey 10 ай бұрын
You want to grab the weights file, inside of assets/weights. That is your voice model. Inside of logs/ is you're index file
@next3108
@next3108 Жыл бұрын
hey i got error on the latest step model inference, first fail in console say RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail), and second this AttributeError: 'NoneType' object has no attribute 'dtype
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Make sure there are no spaces in your folders or anywhere in your folder path
@next3108
@next3108 Жыл бұрын
@@Jarods_Journey there are no spaces, again error
@Jinx_806
@Jinx_806 Жыл бұрын
Hey man help i was stuck on start web... can you help for the error /content/Retrieval-based-Voice-Conversion-WebUI python3: can't open file '/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py': [Errno 2] No such file or directory
@Jinx_806
@Jinx_806 Жыл бұрын
Got it
@supportteam8263
@supportteam8263 11 ай бұрын
How
@Jinx_806
@Jinx_806 11 ай бұрын
@@supportteam8263 instead of using it in local machine use google collab
@ZaiMatoro
@ZaiMatoro Жыл бұрын
Hi, your tutorial helped me a lot (i just need to find a way to get the perfect model training to avoid all those "glitch" stuff, i tried to set the epochs on 200 actually). The thing is, can i download the voice model (or is the files ado-japanese_D_935.pth, ado_japanese_G_935.pth and ado-japanese935.pth + the added_IVF file and total_fea.npy) to then use the model on the RVC-web (the .bat method) and to also be able to use the voice model on other websites / software?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Appreciate it! So I'm slightly confused with your question, but if you want to be able to use these models anywhere, you need the one that's in the weights section, and the trained index file in the logs section. You don't actually need the G/D ones in RVC except for training and doing training at a later time
@BeatsAudios1988
@BeatsAudios1988 Жыл бұрын
in final stage it showing connection error out popup message bro. morethan half hour waiting
@diegobert8142
@diegobert8142 4 ай бұрын
How install it on Linux? make a video lesson, please. Thank's.
@AnubhavGamerX
@AnubhavGamerX 8 ай бұрын
Hlo its says file not found error at pretained / f04D0k.pth pls solve my problem 😢
@jaripeltola
@jaripeltola Жыл бұрын
The step 2a returns an error in loading audio in the webUI. The setup and folder paths are correct.
@jaripeltola
@jaripeltola Жыл бұрын
The same audio files load correctly in the offline version, but there I cannot train a model without GPU.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Make sure there are no space in the path. There may be other stuff but this is usually the fix
@cryptidpet4325
@cryptidpet4325 Жыл бұрын
what happens if the load package dataset doesn't work? it didn't work for me and im unsure why
@cryptidpet4325
@cryptidpet4325 Жыл бұрын
NO I FIGURED IT OUT, I DID NOT NAME MY DRIVE FOLDER DATASET
@naminhtien862
@naminhtien862 Жыл бұрын
What is the difference between 2 audio.wav file in TEMP folder and why we need to wait both of them?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Those are the outputted files from the webpage. If you don't download them, they get deleted.
@user-wy7qe9zh2z
@user-wy7qe9zh2z Жыл бұрын
why did I just set the epoch to 100, but it appears G and D files have up to a few thousand? which one is better?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
The RVC epoch is actually weird in the way they named it, 100 actually means 10k epochs. The G and D files have a few thousand due to this and are named for the step count it takes to go through the data. If you're familiar with sovitssvc, this might make a little more sense. You use the G for any type of inference.
@RakeshKumar-hg1ln
@RakeshKumar-hg1ln 11 ай бұрын
I am training for 300 epochs but after 20 epochs it shows connection error out. What to do
@Jarods_Journey
@Jarods_Journey 11 ай бұрын
You would have to restart everything, however, I think Google is starting to cut down on free usage of Collab...
@singlewave805
@singlewave805 Жыл бұрын
I dont have the window for dataset location...
@toasteroven6761
@toasteroven6761 Жыл бұрын
What's the option at the bottom left of train tab that says "Load pre-trained base model G path" and "Load pre-trained base model D path"? Is it possible to change this to expand or update an existing model with even more epochs? You know, since free Colab has a daily limit, you sometimes can't finish 200 epochs in time... If this is possible, how could I go about doing this? And would it be possible to continue said training with an expanded dataset (i.e. same audio file but updated with additional data) or would it negatively effect the results of the training?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
These are the pretrained models that the authors used to start a "foundation". From what I know, you pretty much need them unless modify some things in the code etc. Since colab has a daily limit, I believe you have to export all of the trained data to your Google drive. I'm not sure if it does this using the export option, but you might have to manually do this (I might have to look into it). Yes, you can add more data, you just need to re process and re feature extract. It has to be the same voice though
@ShortStories-el5sw
@ShortStories-el5sw 9 ай бұрын
how do we train more than 1000 epochs for the model?
@obeyoutube
@obeyoutube Жыл бұрын
Hi! Thanks for the video. It's very useful. I faced an issue during the process. I trained a model and it appeared in the model inference. However, when I wanted to copy the path to my index file, it wasn't there. On which step does this file have to appear in the folder logs/model_name? I didn't extract vocals from a video as I already had an audio file. What should I do, if I have the trained model and there is no index file? Please help me with this issue. Thank you.
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Check out the pinned comment!
@obeyoutube
@obeyoutube Жыл бұрын
@@Jarods_Journey thank you for peeling my eyes =)
@forest1605
@forest1605 Жыл бұрын
hey, after you mmake one voice clone ai how do you make another one? Like, i wanna make another one do I just make another data folder but name it data set instead of dataset or do I put the files with the other data set, or do I delete the files from the last voice clone from that data set and replace it with my new audio files
@Jarods_Journey
@Jarods_Journey Жыл бұрын
You can just use another name or delete the old one (rename) and start over
@senerio2124
@senerio2124 Жыл бұрын
Is there somewhere we can download voices that others have already trained?
@AIAsiaSinger
@AIAsiaSinger Жыл бұрын
I successfully trained a model with 20 epoch. When i try to train it another day with 200 epoch, just right before it generating the index file. The whole colab page crashed and the gradio page shows ​“Error: Connection errored out” . why did that happen :(? thanks in advance!
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Not sure what might be occuring here, but some error must be happening causing the console to stop. Might have to rerun or show what error occured on the cell
@AIAsiaSinger
@AIAsiaSinger Жыл бұрын
@@Jarods_Journey thanks for the reply, i think it's because some data sample size too big. solved.
@prantoroy7054
@prantoroy7054 Жыл бұрын
help... there is no gradio folder in temp folder
@imi761
@imi761 Жыл бұрын
same
@good-gd9ui
@good-gd9ui Жыл бұрын
Thanks for your vedio! I face this error when I run "Start Web" - [Errno 2] No such file or directory: '/content/Retrieval-based-Voice-Conversion-WebUI' /content python3: can't open file '/content/infer-web.py': [Errno 2] No such file or directory Do you know how to fix that? I do have my python install in my laptop! Thanks!
@thearabicmusicland
@thearabicmusicland 6 ай бұрын
same error you found a solution ?
@forest1605
@forest1605 Жыл бұрын
TYS
@pervizkaral9550
@pervizkaral9550 11 ай бұрын
When i want to make a model crepe is Best or harvest?
@jensin3711
@jensin3711 Жыл бұрын
Hey! Need some advice; my Google Colab keeps constantly disconnecting at the end during the training process. I don't know exactly why, but I have a feeling that the GPU is becoming overloaded with the request and simply just crashes. I don't know anything about Python and pretty much follow your tutorial step by step, so trying to resolve how to fix the issue is something I need a little help with. Maybe a way to dedicate more of the GPU to the task to avoid these crashes?
@mushroomcrepes4780
@mushroomcrepes4780 Жыл бұрын
I think google colab disconnect you if you use any webgui applications, they did it with stable diffusion. So you either have to pay or run it on your own machine.
@ridzverse
@ridzverse Жыл бұрын
i've trained a model and didn't give me the index file, how to fix it
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Click the train feature index button
@riniraw
@riniraw Жыл бұрын
First of all, thank you for the video. some questions: 1. I can't download files from google colab, and I have no such icon in my chrome browser. 2. when I try to restore the paths of my models, it says that the directory couldn't be found... I'm not sure why. everything seems to be correct. 3. how do I take the model that I've trained in google colab and run it localy? where do the model files need to go? Thanks again :)
@Jarods_Journey
@Jarods_Journey Жыл бұрын
1. Not sure what's happening, right clicking on the file and downloading can work in this case 2. Make sure the path is being entered in correctly and there are files in the google. If they did not export, this may be the cause. 3. To run local models, you might wanna see the local version of this tutorial to get insight on that process
@aoidesu4213
@aoidesu4213 Жыл бұрын
I have a 10 minute voice dataset but it's 1 file. Do I need to split it like 1 sentence per file or 10 sec like that?
@Jarods_Journey
@Jarods_Journey Жыл бұрын
Yes, you need to split it into 10 second audio samples or you will run out of memory during training.
@emmamorley1836
@emmamorley1836 11 ай бұрын
Why does it always do runtime disconnect for me
@moinmoin3623
@moinmoin3623 9 ай бұрын
I can't see database section after giving permission
How to Make the PERFECT Dataset for RVC AI Voice Training
18:17
Jarods Journey
Рет қаралды 131 М.
GTA 5 vs GTA San Andreas Doctors🥼🚑
00:57
Xzit Thamer
Рет қаралды 24 МЛН
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 46 МЛН
АЗАРТНИК 4 |СЕЗОН 1 Серия
40:47
Inter Production
Рет қаралды 1,1 МЛН
Calyx OS - The next big Android Competitor!?
12:06
Mrwhosetheboss
Рет қаралды 5 МЛН
How to win a argument
9:28
ajaxkmr
Рет қаралды 504 М.
Get the BEST AI Voice Models by Analyzing Tensorboard for RVC
11:41
Jarods Journey
Рет қаралды 30 М.
The ULTIMATE Browser Tier List (Based Tier to Spyware Tier)
39:19
Eric Murphy
Рет қаралды 2,1 МЛН
AI Voice Cloning for Singing with RVC - Guide and Set-up
16:29
Jarods Journey
Рет қаралды 291 М.
How To Clone ANY Voice In Under 5 MIN w/ Eleven Labs AI
14:09
The Joe Rogan AI Experience
Рет қаралды 24 М.
New Realtime AI Voice Changing Advancements - NO GPU
6:16
Jarods Journey
Рет қаралды 23 М.
GTA 5 vs GTA San Andreas Doctors🥼🚑
00:57
Xzit Thamer
Рет қаралды 24 МЛН