CURRENT Issue 9-15-2023: I was made aware that Google has started banning RVC usage on free accounts, similar to what it did to stable diffusion. There is no fix for this ATM other than to get the PRO version of Collab. IMPORTANT: You MUST click "Train feature index" at 12:07 in order to get the IVF index file you'll need later. As noted by another comment, this can be done before or after training. Sorry about that guys!
Check again at around 5:00 to make sure you mounted your drive and all of the file paths are editted correctly. If you're not seeing any data, then for some reason either the cell didn't finish correctly or there's some type of file path error.
Sir, this is the error i am getting from content data set Could not parse variable and value from ""/content/drive/MyDrive/dataset/AKALEYO_NEE_vocals.zip"". Expected the line to start with a variable assignment. please help me
I am having this same issue, I've tried everything, including just naming my zip file what the program seems to be looking for. No idea what went wrong. I don't even have the same opportunity under the "Dataset location" to edit the path as seen in the video. It is just all red saying "Could not parse variable and value from ""/content/drive/MyDrive/dataset/lulu20230327_32k.zip"". Expected the line to start with a variable assignment" and the edit symbols are greyed out and inaccessible. Not sure what has gone wrong, I followed everything precisely. Would love some help! Glad to see I'm not the only one!
If you clicked the "Train" button and not the "One click training" button you have to also click the middle button "train feature index", that's what makes the .index file you need. You can do it before clicking training or after, it doesn't matter.
Hi, i see error " can't open file '/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py': [Errno 2] No such file or directory" at start the web step. How can I fix it?
Hey! I've been following your tutorials for quite a while, and I must say - they are helping me A LOT. But in this one, I need a little help! When I run the "start web" code that you show on 6:37 I get the error message saying: /content/Retrieval-based-Voice-Conversion-WebUI Traceback (most recent call last): File "/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py", line 7, in import faiss ModuleNotFoundError: No module named 'faiss' Have you seen something like this and how we can fix it?
Hey man help i was stuck on start web... can you help for the error /content/Retrieval-based-Voice-Conversion-WebUI python3: can't open file '/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py': [Errno 2] No such file or directory
Hey! At 14:05 when I click on refresh the vocals, mine doesnt get refreshed. More files arent appearing like in the tutorial and im also getting another error besides the me_zip. I also get a "/content/drive/MyDrive/dataset/.ipynb_checkpoints: Is a directory" error. Can someone please help me?
Do you have any files that are .ipynb_checkpoints in your datasets folder? Not too sure unfortunately if files aren't appearing, could be possible some parts of the process never finished.
@@Jarods_Journey No worries, thanks! Do you happen to know what "Search feature ratio" does? I tried setting it all the way left & right, without much difference
Constant connection errors on every step in the public web RVC, for a second they seem to work and then give error messages. I managed to get to feature extraction but each time it just halts the process, says there is a connection error and sometime after colab disconnects
weird. my comment got deleted. Anyways , put this right at the start of Install dependencies if you keep failing to build pyworld and/or not be able to find faiss !pip install pyworld==0.3.2 !pip install numpy==1.23.5
Yt doesn't like full code comments or links, unfortunately, so it seems like it filters those out. Appreciate the fix, but we'll have to wait until it's pull-requested over on the repo or updated 🤟
Thanks for your videos on RVC, they were very helpful to me to get started with this. One important detail, in the video you put up a graphic telling viewers to click the train button, not one click training. This was good because I think one click training is slightly bugged, but you forgot to mention that if you don't use one click training you MUST also run feature training (2nd big button) or you won't have an .index file.
Appreciate it! Man, I had to do a double take real quick, but I do say it to click it at 8:16 😅. I realized after editing that one-click training does all of the previous steps before, so it was redundant to click one-click training. If you run all the previous buttons before and click one click training, it just redos all of the previous steps. Edit: misread the comment, looks like an oversight and a missed step on my end!
I'm running on pro version, I'm getting the following error: I added a single .wav file to the /content/dataset-2/ folder inside my collab start preprocess ['trainset_preprocess_pipeline_print.py', '/content/dataset-2', '40000', '12', '/content/Retrieval-based-Voice-Conversion-WebUI/logs/me', 'False'] /content/dataset-2/.ipynb_checkpoints->Traceback (most recent call last): File "/content/Retrieval-based-Voice-Conversion-WebUI/my_utils.py", line 14, in load_audio ffmpeg.input(file, threads=0) File "/usr/local/lib/python3.10/dist-packages/ffmpeg/_run.py", line 325, in run raise Error('ffmpeg', out, err) ffmpeg._run.Error: ffmpeg error (see stderr output for detail) During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/content/Retrieval-based-Voice-Conversion-WebUI/trainset_preprocess_pipeline_print.py", line 75, in pipeline audio = load_audio(path, self.sr) File "/content/Retrieval-based-Voice-Conversion-WebUI/my_utils.py", line 19, in load_audio raise RuntimeError(f"Failed to load audio: {e}") RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail) /content/dataset-2/record.wav->Suc. end preprocess
The preprocess seems to have worked, but make sure there are no other files in there and no spaces in your path. This is usually the fix that I see for people.
@Jarods_Journey I have a question about the dataset that you used to train the main model. Is it big? How big? I train french voices and they sing with an english accent! Cute, but sometimes weird!! I was wondering how to have a big french model and how many data was needed to do so. BTW huge congrats, your work is amazing. Love it.
Appreciate it! The datasets I use range from 10 minutes to 3 hours, but my average is around an hour of audio. In my experience, the index file controls the "accent" so you can try adjusting it to 1 and seeing if it results in a better accent
Hey, everyone. I've trained a model with 400 epochs with voice samples of 10 min audio. When i'm doing the voice conversion, the words are not even pronounced correctly. The sound looks more like humming instead of speaking. What am i doing wrong? Appreciate your help.
I have to say your tutorial is very useful so thank you for it but I have a question the colab notebook seems to run an older version of RVC where there was no rvmpe so how to run the latest version on colab?
@Jardos_Journey While I'm in Accompaniment and vocal seperation. I am not able to get any model to select like (H5 model). Can anybody please help me find my error.
hey i got error on the latest step model inference, first fail in console say RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail), and second this AttributeError: 'NoneType' object has no attribute 'dtype
train feature index isnt working like its not saying ann error or anything its just not starting, it said it added soething and then nothing else happened 82058, 256),2104 training adding 成功构建索引,added_IVF2104_Flat_nprobe_1.index says that then doesnt do anything
Traceback (most recent call last): File "/content/-EVC-/extract_feature_print.py", line 13, in version = sys.argv[6] IndexError: list index out of range ['extract_f0_print.py', '/content/-EVC-/logs/Cristi', '2', 'crepe', '115'] how do i fix this
@@cryptidpet4325 that might be why. Im not entirely sure but when I did it with getting the vocals seperated using another site, it didnt work. But when I just used teh vocals that werent seperated and let google collab do it then it worked
Hi Jarods, I am using Colab Pro, and didnt have issue at first, but now everytime I want to train model, it always shows me error, because two log folders are missing: 2a_f0 and 2b-f0nsf. If I add them manually, training will proceed to the end, but they will remain empty and my model won't change voice on cover song successfully. I'm trying to solve the issue past three days, but without success. Can you help me with that? Why are these folders missing now, and not before, and how this can be solved? Am I doing something wrong? Thanks in advance!
I had an interesting error come up when I started feature extraction. This is what I was met with at the end File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/conv.py", line 309, in _conv_forward return F.conv1d(input, weight, bias, self.stride, RuntimeError: Calculated padded input size per channel: (1). Kernel size: (2). Kernel size can't be greater than actual input size all-feature-done
Until yesterday it was working, now it isn't working anymore. The main errors that I noticed: "error: subprocess-exited-with-error" "Building wheel for pyworld (pyproject.toml) did not run successfully." "Building wheel for pyworld (pyproject.toml) ... error" "ERROR: Failed building wheel for pyworld" "Failed to build pyworld" "ERROR: Could not build wheels for pyworld, which is required to install pyproject.toml-based projects" "ModuleNotFoundError: No module named 'faiss'"
That issue arose for me around the exact same time. Cannot get it to build that wheel all of the sudden, but it's not a ""ModuleNotFoundError" with me.
Im receiving this error ERROR: Failed building wheel for pyworld Failed to build pyworld ERROR: Could not build wheels for pyworld, which is required to install pyproject.toml-based projects any idea why?
That means there was some type of error in the training process and it never got to outputting a final trained model. In this case, you'll have to run training again for the model
Hello, when I use collab, the most time-consuming connection is suddenly disconnected, and when I try to connect again, it does not work at all. This issue has been happening to me for several days, please give me a solution so that I can use it again.
Thank you so much mate. But I've got a situation here. after accompaniment and vocal separation and press CONVERT, I always got Connection errored Out. and can't convert. what is that? what i gonna do?
Hi I am on mac, I don’t run it locally , I trained 2 models , now I can’t find them, I didn’t download them, do I have to start all over again? Or they are somewhere on my mac? Thank you
Im gonna need some clarification.. everything is on the Google drive. If you didn't download anything, nothing will be on your Mac. If you didn't save them after training, they will have been lost and you'll need to retrain them
I'm running rvc in google Collab ,, after one hour of training model the google Collab disconnect automatically and i reconnected again but every thing restart and lose everything ,, and show me this message in gradio no interface it there any solution ?
Unfortunately if it disconnects during training, it doesn't retain any of the info so all of that data is lost. Just a limitation of Collab notebooks and they only allow you to use it for around 12 hours.
So, theoretically, if it disconnects because of 12 hours period, how am I supposed to finish extraction and training faster? Or there're some checkpoints for data, so I can keep going from the previous one? ('Cause it threw me error at first try and after that collab didn't want to start again)
@@Jarods_Journey Thank you, I've checked it again and again, and then I found my flaws. Also, I want to ask, how can I train the bot more if I come back on another day? How can I use the last trained version instead of starting training all over again?
I don't see anything under the inferencing voice tab. I have refreshed the voice list. I do see the index logs file under the "path to the .index" file tab, however. Process data step ended with "end preprocess" and the feature extraction step ended with "all-feature-done". Can anyone help? Thanks for your content, Jarod.
I might need a bit of clarification, but I believe you have everything finished that you need to train. Once the training is done, in the inference tab, you gotta click the refresh timbre button. One additional thing is check to see if the training outputted a .pth file in the weights folder, if there's nothing there, the training didn't finish correctly!
@@Jarods_Journey Yeah I when I check the weights file there isn't any .pth file. It is weird. Anyway, I watched your Google Colab tutorial for SVC and I managed to make it work! Thank you so much, mate. Keep it up.
So do you train like for example 1 1/2 hour of your own voice with the harvard sentences(or maybe something better I'm unaware of) and then use 1 1/2 hour of a mp4 vocals or whatever voice you're trying to clone to go into the realtime AI voice changer?
You can do that, but the voice samples don't matter as long as they're complete words and there's no background sound. Once an RVC model is trained, that is what goes into the voice changer/
Hi man! Just a query! I have trained my voice model for narration yesterday. Now, I would like to convert a TTS voice to my voice. I followed your instructions restored the path before going to web interface. Now, as you mentioned we have to put the path address of IVF file in database file path but now, I only see G and D file of my model when I am going to RVC - logs - my project. Can you please help on where to find the ivf file when we are coming on the next day.
how to continue training my model ... I had trained it for 200 epochs ... but it still not that good ... Is there any way to continue training it ... Or should I train a new model ???
I successfully trained a model with 20 epoch. When i try to train it another day with 200 epoch, just right before it generating the index file. The whole colab page crashed and the gradio page shows “Error: Connection errored out” . why did that happen :(? thanks in advance!
Not sure what might be occuring here, but some error must be happening causing the console to stop. Might have to rerun or show what error occured on the cell
Thanks for your vedio! I face this error when I run "Start Web" - [Errno 2] No such file or directory: '/content/Retrieval-based-Voice-Conversion-WebUI' /content python3: can't open file '/content/infer-web.py': [Errno 2] No such file or directory Do you know how to fix that? I do have my python install in my laptop! Thanks!
Question when executing "restore pth from google drive": After re-importing from GDrive to goolgecolab the *.index and the *.npy file are uploaded into the "content" folder of googlecolab. But is this the correct folder where those file should be in the end? Because afterwards when I am in the "model inference" tab, I have to choose then the "Feature search database file". I should then link to the *.index file which is now lying under "content", is that correct? and: how many epochs do you think are perfect? is it possible to overtrain and get bad results when putting the epoch too high? let's say eg. 200? Thank youuuu!
Yes, when you reupload back to colab, you have to copy all of the correct file paths to the spots where they belong at. There is no perfect epoch count as it is dependent on data, I haven't seen any instance of overtraining and it's not that severe when training on these small datasets.
thank you so much for full detailed tutorial , after watched so many videos finally i found your video with full detail step by step. thank you so much 🙏
Great Tutorial. I'm having some problems with the last part, even after using a 2 min. short audio (inference) I'm getting the timeouts and there is also no folder with the name Gradio in the TEMP folder. There are only couple of INFO messages in colab and nothing else.
@@theentirecircus6623 hi ! I have the same issue. I've restarted Runtime a few times but it doesn't help. My TEMP folder is empty. Did you solve the problem?
@@obeyoutube I've just tried again with another notebook and it worked (I've waited couple of minutes after getting the timeout error). @Jarods_Journey I can link the notebook if it's okay, but it's not from the original repo
The RVC epoch is actually weird in the way they named it, 100 actually means 10k epochs. The G and D files have a few thousand due to this and are named for the step count it takes to go through the data. If you're familiar with sovitssvc, this might make a little more sense. You use the G for any type of inference.
hey, after you mmake one voice clone ai how do you make another one? Like, i wanna make another one do I just make another data folder but name it data set instead of dataset or do I put the files with the other data set, or do I delete the files from the last voice clone from that data set and replace it with my new audio files
Thank you so much!!! I finally got it to work!! I had to adjust the batch size down to 8 because the CUPA kept running out of memory (despite getting COLAB Pro 🥲) and it worked, however there is still some "robotic feedback" at higher points in the song that make it sound like it is glitching, but im thinking that maybe i can just add a glitch effect to my videos at those points or have a producer mix those bits out? If you have any suggestions on removing unnatural sounds from the final trained vocals, that will help alot. It could be possibly from mic feedback when recording singing vocals, but I am not sure. Your videos are *amazing* 🎉
@@Sahgee some voices have pitch limitations and can't hit pitches that the model hasn't seen, that or the input audio is a little too hard for it to match 🤟
@@Jarods_Journey thank you for responding! Do you have recommendations for combatting this issue? I was thinking of retraining my model with even more vocal (singing) samples of my voice in better quality including a similar rendition of the source song that I am trying to clone to my voice. Do you think that will help?
@@Sahgee adding more high quality data always helps, but if you're trying to clone a specific song(input song), listen to the input song to see if there are any weird artifacts with the vocal separated file. I've noticed that can cause models issues
06:45 Instead of the link I get an error Traceback (most recent call last): File "/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py", line 87, in class ToolButton(gr.Button, gr.components.FormComponent): AttributeError: module 'gradio.components' has no attribute 'FormComponent'. Did you mean: 'IOComponent'? So I guess that's where my attempt ends
There's an issue with the latest GitHub repository, seems like it broke the colab version. You might wanna check the fix that people said about here: github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/issues/549
I've watched a different tutorial which didn't help me at all. Sound was robotic, and generally no much explanations whatsoever. But this one helped me, and the trained voice sounds perfect! I am extremely thankful for this. You are the best! :D
You may be executing code that is disallowed which may terminate your runtime without warning. Colab prioritizes interactive notebook compute and disallows some types of usage when executing code without compute units as outlined in the FAQ. Your compute unit balance is 0.Purchase more I am continuously getting this message Please help
CURRENT Issue 9-15-2023: I was made aware that Google has started banning RVC usage on free accounts, similar to what it did to stable diffusion. There is no fix for this ATM other than to get the PRO version of Collab.
IMPORTANT: You MUST click "Train feature index" at 12:07 in order to get the IVF index file you'll need later. As noted by another comment, this can be done before or after training.
Sorry about that guys!
Thanks !
what is the difference between the two? is there anything different about it?
good lookin out
what is the consequence? Is it train model -> train feature index -> one-click training?
@@NamDinh-b3u whats the diff between them
10:38 you say (and do) "click one-click training" but you display "do train model". so what is it now? which one is the right one?
Do train models then after training, click extract feature index
@@Jarods_Journey isn't that what one-click training does? (training + feature extraction with one click)
7:21 when i get to this step after following the rest correctly there are no files here for me to train
Check again at around 5:00 to make sure you mounted your drive and all of the file paths are editted correctly. If you're not seeing any data, then for some reason either the cell didn't finish correctly or there's some type of file path error.
@@Jarods_Journey yeah i did... the zip is ther and the guy whos helping me is confused about this too
My last programming was in 1997 with basic (i was 12). This is whole new universe. Thank you!
Trained IVF file didn't appear in log -> me folder. How could I solve this?
Sir, this is the error i am getting from content data set
Could not parse variable and value from ""/content/drive/MyDrive/dataset/AKALEYO_NEE_vocals.zip"". Expected the line to start with a variable assignment. please help me
I am having this same issue, I've tried everything, including just naming my zip file what the program seems to be looking for. No idea what went wrong. I don't even have the same opportunity under the "Dataset location" to edit the path as seen in the video. It is just all red saying "Could not parse variable and value from ""/content/drive/MyDrive/dataset/lulu20230327_32k.zip"". Expected the line to start with a variable assignment" and the edit symbols are greyed out and inaccessible. Not sure what has gone wrong, I followed everything precisely. Would love some help! Glad to see I'm not the only one!
Hey, at 15:28 i can't find the file in the model folder. Can u help me why
I have the same problem... any solution?
If you clicked the "Train" button and not the "One click training" button you have to also click the middle button "train feature index", that's what makes the .index file you need. You can do it before clicking training or after, it doesn't matter.
@@mikecameron2327 thanks :D
You earned a new subscriber ❤
And hopefully I bestowed some new knowledge :)
Cám ơn bạn quá phức tạp không làm theo được khi web hiển thị không giống như hướng dẫn
I dont have the window for dataset location...
14:44 - inference
in final stage it showing connection error out popup message bro. morethan half hour waiting
Hi, i see error " can't open file '/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py': [Errno 2] No such file or directory" at start the web step. How can I fix it?
i get the same error... have you found any fix?
hi the video is so straight but i have an issue i dont have the "trained_IVF201_Flat....." but i have train.log what should i do?
help... there is no gradio folder in temp folder
same
Wil it work with the Google Colab Pay As You Go options?
Hey! I've been following your tutorials for quite a while, and I must say - they are helping me A LOT. But in this one, I need a little help!
When I run the "start web" code that you show on 6:37 I get the error message saying:
/content/Retrieval-based-Voice-Conversion-WebUI
Traceback (most recent call last):
File "/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py", line 7, in
import faiss
ModuleNotFoundError: No module named 'faiss'
Have you seen something like this and how we can fix it?
Try re-running the cells in the Collab, faiss may not have installed fully or correctly
Hey man help i was stuck on start web...
can you help for the error
/content/Retrieval-based-Voice-Conversion-WebUI
python3: can't open file '/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py': [Errno 2] No such file or directory
Got it
How
@@supportteam8263 instead of using it in local machine use google collab
Hey! At 14:05 when I click on refresh the vocals, mine doesnt get refreshed. More files arent appearing like in the tutorial and im also getting another error besides the me_zip. I also get a "/content/drive/MyDrive/dataset/.ipynb_checkpoints: Is a directory" error. Can someone please help me?
Do you have any files that are .ipynb_checkpoints in your datasets folder? Not too sure unfortunately if files aren't appearing, could be possible some parts of the process never finished.
Nice, but the index file does not apear in logs, I the only problem I have had
Try clicking the middle button that trains feature index
How install it on Linux? make a video lesson, please. Thank's.
Why i can’t find traind vfi 201 11:24
This is missed in the video, but check the pinned comment of that video!
@@Jarods_Journey okay but what if i already separate the vocals from website called vocal remover when i hit refresh it gives me error 14:50
Very helpful - thanks! Whats the difference bw pm, harvest, and dio?
Not too sure, but harvest produces the best results in my testing.
There is a difference, I just haven't looked it up extensively lol.
@@Jarods_Journey No worries, thanks! Do you happen to know what "Search feature ratio" does? I tried setting it all the way left & right, without much difference
@@theAIsearch I think the affects accent of the voice so if at 1, it may retain more of the accent
when I tried opening the 'temp' folder nothing was showing
May mean that it never finished, could be a recently introduced bug or something changed if this is the case
@@Jarods_Journey alright ill try it again later thanks
Constant connection errors on every step in the public web RVC, for a second they seem to work and then give error messages. I managed to get to feature extraction but each time it just halts the process, says there is a connection error and sometime after colab disconnects
weird. my comment got deleted. Anyways , put this right at the start of Install dependencies if you keep failing to build pyworld and/or not be able to find faiss
!pip install pyworld==0.3.2
!pip install numpy==1.23.5
My comment was deleted too, I sent whole installation line which should've fixed that
Yt doesn't like full code comments or links, unfortunately, so it seems like it filters those out.
Appreciate the fix, but we'll have to wait until it's pull-requested over on the repo or updated 🤟
Thanks for your videos on RVC, they were very helpful to me to get started with this. One important detail, in the video you put up a graphic telling viewers to click the train button, not one click training. This was good because I think one click training is slightly bugged, but you forgot to mention that if you don't use one click training you MUST also run feature training (2nd big button) or you won't have an .index file.
Appreciate it! Man, I had to do a double take real quick, but I do say it to click it at 8:16 😅. I realized after editing that one-click training does all of the previous steps before, so it was redundant to click one-click training. If you run all the previous buttons before and click one click training, it just redos all of the previous steps.
Edit: misread the comment, looks like an oversight and a missed step on my end!
It’s now banned tho 😢
I'm running on pro version, I'm getting the following error:
I added a single .wav file to the /content/dataset-2/ folder inside my collab
start preprocess
['trainset_preprocess_pipeline_print.py', '/content/dataset-2', '40000', '12', '/content/Retrieval-based-Voice-Conversion-WebUI/logs/me', 'False']
/content/dataset-2/.ipynb_checkpoints->Traceback (most recent call last):
File "/content/Retrieval-based-Voice-Conversion-WebUI/my_utils.py", line 14, in load_audio
ffmpeg.input(file, threads=0)
File "/usr/local/lib/python3.10/dist-packages/ffmpeg/_run.py", line 325, in run
raise Error('ffmpeg', out, err)
ffmpeg._run.Error: ffmpeg error (see stderr output for detail)
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/content/Retrieval-based-Voice-Conversion-WebUI/trainset_preprocess_pipeline_print.py", line 75, in pipeline
audio = load_audio(path, self.sr)
File "/content/Retrieval-based-Voice-Conversion-WebUI/my_utils.py", line 19, in load_audio
raise RuntimeError(f"Failed to load audio: {e}")
RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail)
/content/dataset-2/record.wav->Suc.
end preprocess
The preprocess seems to have worked, but make sure there are no other files in there and no spaces in your path. This is usually the fix that I see for people.
how much colab pro credit might be required
@Jarods_Journey I have a question about the dataset that you used to train the main model. Is it big? How big? I train french voices and they sing with an english accent! Cute, but sometimes weird!! I was wondering how to have a big french model and how many data was needed to do so. BTW huge congrats, your work is amazing. Love it.
Appreciate it! The datasets I use range from 10 minutes to 3 hours, but my average is around an hour of audio. In my experience, the index file controls the "accent" so you can try adjusting it to 1 and seeing if it results in a better accent
@@Jarods_Journey that's correct, english accent has almost gone. thx
Hey, everyone. I've trained a model with 400 epochs with voice samples of 10 min audio. When i'm doing the voice conversion, the words are not even pronounced correctly. The sound looks more like humming instead of speaking. What am i doing wrong? Appreciate your help.
i've trained a model and didn't give me the index file, how to fix it
Click the train feature index button
I have to say your tutorial is very useful so thank you for it but I have a question
the colab notebook seems to run an older version of RVC where there was no rvmpe
so how to run the latest version on colab?
@Jardos_Journey While I'm in Accompaniment and vocal seperation. I am not able to get any model to select like (H5 model). Can anybody please help me find my error.
Hlo its says file not found error at pretained / f04D0k.pth pls solve my problem 😢
hey i got error on the latest step model inference, first fail in console say RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail), and second this AttributeError: 'NoneType' object has no attribute 'dtype
Make sure there are no spaces in your folders or anywhere in your folder path
@@Jarods_Journey there are no spaces, again error
train feature index isnt working like its not saying ann error or anything its just not starting, it said it added soething and then nothing else happened
82058, 256),2104
training
adding
成功构建索引,added_IVF2104_Flat_nprobe_1.index
says that then doesnt do anything
That should mean it completed, it would then be in the logs folder.
@@Jarods_Journey oh right! I got it tysm it works
I understand what everything here is except for what the MP4 file is, and is for. Can you elaborate on that?
Ah nevermind. I understand. I was wanting to use this to do the realtime with my microphone. You wouldn't need the MP4 file for that I guess.
Traceback (most recent call last):
File "/content/-EVC-/extract_feature_print.py", line 13, in
version = sys.argv[6]
IndexError: list index out of range
['extract_f0_print.py', '/content/-EVC-/logs/Cristi', '2', 'crepe', '115']
how do i fix this
18:00 - that would be a good glitch voice for different Sci-Fi effects!
Most definitely 😂, I'm sure its highly possible to find some use cases for it
The python console in the Collab is throwing me errors about non-existent modules whenever I click on the "start web" cell. Why is this?
OKAY- would if there's NOTHING in your TEMP folder when getting an error on gradio????
Not sure, some error occured on the web interface, might wanna check the github issues to see if anyone else got this
ive got the same issue, did you let google collab seperate the vocals from the audio, or did you get another site to seperate the vocals
@@forest1605 no I seperated my vocals from an audio from a differ site
@@cryptidpet4325 that might be why. Im not entirely sure but when I did it with getting the vocals seperated using another site, it didnt work. But when I just used teh vocals that werent seperated and let google collab do it then it worked
Is there any way to move GPU work for AI voice changer in to Google Colab? Cuz im getting 50-60% usage on RTX 3070 with Voice Changer 😂
Hi Jarods, I am using Colab Pro, and didnt have issue at first, but now everytime I want to train model, it always shows me error, because two log folders are missing: 2a_f0 and 2b-f0nsf. If I add them manually, training will proceed to the end, but they will remain empty and my model won't change voice on cover song successfully.
I'm trying to solve the issue past three days, but without success.
Can you help me with that? Why are these folders missing now, and not before, and how this can be solved? Am I doing something wrong?
Thanks in advance!
what happens if the load package dataset doesn't work? it didn't work for me and im unsure why
NO I FIGURED IT OUT, I DID NOT NAME MY DRIVE FOLDER DATASET
The step 2a returns an error in loading audio in the webUI. The setup and folder paths are correct.
The same audio files load correctly in the offline version, but there I cannot train a model without GPU.
Make sure there are no space in the path. There may be other stuff but this is usually the fix
I had an interesting error come up when I started feature extraction.
This is what I was met with at the end
File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/conv.py", line 309, in _conv_forward
return F.conv1d(input, weight, bias, self.stride,
RuntimeError: Calculated padded input size per channel: (1). Kernel size: (2). Kernel size can't be greater than actual input size
all-feature-done
I believe I saw this error somewhere in the GitHub page, you might wanna check there on the RVC issues tab
is this how you train voice model to use voice ?
Yes, for the voice changer, yup.
Until yesterday it was working, now it isn't working anymore. The main errors that I noticed:
"error: subprocess-exited-with-error"
"Building wheel for pyworld (pyproject.toml) did not run successfully."
"Building wheel for pyworld (pyproject.toml) ... error"
"ERROR: Failed building wheel for pyworld"
"Failed to build pyworld"
"ERROR: Could not build wheels for pyworld, which is required to install pyproject.toml-based projects"
"ModuleNotFoundError: No module named 'faiss'"
That issue arose for me around the exact same time. Cannot get it to build that wheel all of the sudden, but it's not a ""ModuleNotFoundError" with me.
Im receiving this error
ERROR: Failed building wheel for pyworld
Failed to build pyworld
ERROR: Could not build wheels for pyworld, which is required to install pyproject.toml-based projects
any idea why?
I install all of the cells and when i click on the web Portion it says Python 3 cant open file or directory, What am i doing wrong?
I am training for 300 epochs but after 20 epochs it shows connection error out.
What to do
You would have to restart everything, however, I think Google is starting to cut down on free usage of Collab...
Why does it always do runtime disconnect for me
All process of training complete but in Weights i can not see anything (.pth file)....why?
That means there was some type of error in the training process and it never got to outputting a final trained model. In this case, you'll have to run training again for the model
Hello, when I use collab, the most time-consuming connection is suddenly disconnected, and when I try to connect again, it does not work at all. This issue has been happening to me for several days, please give me a solution so that I can use it again.
how do we train more than 1000 epochs for the model?
Thank you so much mate. But I've got a situation here. after accompaniment and vocal separation and press CONVERT, I always got Connection errored Out. and can't convert. what is that? what i gonna do?
i dont have gradio in my temp folder
Hi I am on mac, I don’t run it locally , I trained 2 models , now I can’t find them, I didn’t download them, do I have to start all over again? Or they are somewhere on my mac? Thank you
Im gonna need some clarification.. everything is on the Google drive. If you didn't download anything, nothing will be on your Mac. If you didn't save them after training, they will have been lost and you'll need to retrain them
@@Jarods_Journey thank you so much for everything u do, yes I had to retrain, my fault.
I'm running rvc in google Collab ,, after one hour of training model the google Collab disconnect automatically and i reconnected again but every thing restart and lose everything ,, and show me this message in gradio no interface
it there any solution ?
Unfortunately if it disconnects during training, it doesn't retain any of the info so all of that data is lost. Just a limitation of Collab notebooks and they only allow you to use it for around 12 hours.
is there any way like Collab to run it ? because it disconnected after one hour and I couldn't make anything@@Jarods_Journey
Is it possible to do this on the windows terminal because whenever I try to do it, I keep loosing connection so is there a more permanent solution?
So, theoretically, if it disconnects because of 12 hours period, how am I supposed to finish extraction and training faster? Or there're some checkpoints for data, so I can keep going from the previous one?
('Cause it threw me error at first try and after that collab didn't want to start again)
I can't see database section after giving permission
how to add back instrument?
Need audio editing software, you could use ffmpeg or audacity
I 've done exactly as your guide, but when i hear the output, it's still the same vocal as inputted. Can you help me out here?
I haven't seen this issue before, does it happen on all voices? If so, could be an RVC bug that you might want to repot to their github page
@@Jarods_Journey Thank you, I've checked it again and again, and then I found my flaws. Also, I want to ask, how can I train the bot more if I come back on another day? How can I use the last trained version instead of starting training all over again?
I trained it for 200 epochs, how do I reuse that model to train it to 400 without having to start it from 0 epochs again?
I don't see anything under the inferencing voice tab. I have refreshed the voice list. I do see the index logs file under the "path to the .index" file tab, however. Process data step ended with "end preprocess" and the feature extraction step ended with "all-feature-done". Can anyone help?
Thanks for your content, Jarod.
I might need a bit of clarification, but I believe you have everything finished that you need to train. Once the training is done, in the inference tab, you gotta click the refresh timbre button.
One additional thing is check to see if the training outputted a .pth file in the weights folder, if there's nothing there, the training didn't finish correctly!
@@Jarods_Journey Yeah I when I check the weights file there isn't any .pth file. It is weird.
Anyway, I watched your Google Colab tutorial for SVC and I managed to make it work! Thank you so much, mate. Keep it up.
Does anyone know how to stop the voice changer from saying what I say twice?
So do you train like for example 1 1/2 hour of your own voice with the harvard sentences(or maybe something better I'm unaware of) and then use 1 1/2 hour of a mp4 vocals or whatever voice you're trying to clone to go into the realtime AI voice changer?
You can do that, but the voice samples don't matter as long as they're complete words and there's no background sound. Once an RVC model is trained, that is what goes into the voice changer/
I have a 10 minute voice dataset but it's 1 file.
Do I need to split it like 1 sentence per file or 10 sec like that?
Yes, you need to split it into 10 second audio samples or you will run out of memory during training.
Hi man! Just a query! I have trained my voice model for narration yesterday. Now, I would like to convert a TTS voice to my voice. I followed your instructions restored the path before going to web interface. Now, as you mentioned we have to put the path address of IVF file in database file path but now, I only see G and D file of my model when I am going to RVC - logs - my project. Can you please help on where to find the ivf file when we are coming on the next day.
You want to grab the weights file, inside of assets/weights. That is your voice model. Inside of logs/ is you're index file
how to continue training my model ... I had trained it for 200 epochs ... but it still not that good ... Is there any way to continue training it ... Or should I train a new model ???
Can we do this on low end pc ?
Colab runs on google servers, so you can run it on any device :D
@@Jarods_Journey Thanks
When i want to make a model crepe is Best or harvest?
Which one do you think is better for singing voice, so-vits-svc (and fork version) or RVC?
RVC
@@Jarods_Journey Nice! have you tried the mangio-RVC-fork?
I successfully trained a model with 20 epoch. When i try to train it another day with 200 epoch, just right before it generating the index file. The whole colab page crashed and the gradio page shows “Error: Connection errored out” .
why did that happen :(? thanks in advance!
Not sure what might be occuring here, but some error must be happening causing the console to stop. Might have to rerun or show what error occured on the cell
@@Jarods_Journey thanks for the reply, i think it's because some data sample size too big. solved.
Thanks for your vedio! I face this error when I run "Start Web" -
[Errno 2] No such file or directory: '/content/Retrieval-based-Voice-Conversion-WebUI'
/content
python3: can't open file '/content/infer-web.py': [Errno 2] No such file or directory
Do you know how to fix that? I do have my python install in my laptop!
Thanks!
same error you found a solution ?
Is there TTS for RVC?
Do you know what is different between added and trained IVF file?
I'm assuming just a smaller, quantized file containing latents for the voice. For RVC inferencing, you'll just use the trained one.
@@Jarods_Journey but trained file is not backed up, only added one
th-cam.com/video/9wu6LSue_dU/w-d-xo.html
The gradio folder did not pop up for me, what's the issue?
x2
The may have been an error or issue when trying to do the audio file, on colab, I'm not too sure what else
I dont get train model file
Question when executing "restore pth from google drive": After re-importing from GDrive to goolgecolab the *.index and the *.npy file are uploaded into the "content" folder of googlecolab. But is this the correct folder where those file should be in the end? Because afterwards when I am in the "model inference" tab, I have to choose then the "Feature search database file". I should then link to the *.index file which is now lying under "content", is that correct?
and: how many epochs do you think are perfect? is it possible to overtrain and get bad results when putting the epoch too high? let's say eg. 200?
Thank youuuu!
Yes, when you reupload back to colab, you have to copy all of the correct file paths to the spots where they belong at. There is no perfect epoch count as it is dependent on data, I haven't seen any instance of overtraining and it's not that severe when training on these small datasets.
Can I use on My Mac running, Os Mojave?
I m using Imac
do i delete old *.pth files if my latest file is D_98000.pth and G_98000.pth??
You can to save space. Some people like to go back and listen to them to make sure they didn't "overtrain", but most cases it should be fine
thank you so much for full detailed tutorial , after watched so many videos finally i found your video with full detail step by step. thank you so much 🙏
Appreciate it, glad I could help :D!
Great Tutorial. I'm having some problems with the last part, even after using a 2 min. short audio (inference) I'm getting the timeouts and there is also no folder with the name Gradio in the TEMP folder. There are only couple of INFO messages in colab and nothing else.
Hmm, not sure what's happening here, it might not actually be processing then. Have you tried restarting runtime?
@@Jarods_Journey actually not, I only tried rerunning the cell. I'll try restarting the runtime as well, then post the results here
@@theentirecircus6623 hi ! I have the same issue. I've restarted Runtime a few times but it doesn't help. My TEMP folder is empty. Did you solve the problem?
@@obeyoutube I've just tried again with another notebook and it worked (I've waited couple of minutes after getting the timeout error). @Jarods_Journey I can link the notebook if it's okay, but it's not from the original repo
@@theentirecircus6623 Link it please
why did I just set the epoch to 100, but it appears G and D files have up to a few thousand? which one is better?
The RVC epoch is actually weird in the way they named it, 100 actually means 10k epochs. The G and D files have a few thousand due to this and are named for the step count it takes to go through the data. If you're familiar with sovitssvc, this might make a little more sense. You use the G for any type of inference.
hey, after you mmake one voice clone ai how do you make another one? Like, i wanna make another one do I just make another data folder but name it data set instead of dataset or do I put the files with the other data set, or do I delete the files from the last voice clone from that data set and replace it with my new audio files
You can just use another name or delete the old one (rename) and start over
for stable diffusion I was using civitai for some pretrained models and examples. Anybody knows if Is there any website that I can use for RVC?
AI hub discord
@@Jarods_Journey thank you
If we want to retrain with additional vocal samples, do we have to start the entire process over?
I recommend you do restart. I believe you can add more samples, but you do have to rerun all the preprocessing steps again.
Thank you so much!!! I finally got it to work!! I had to adjust the batch size down to 8 because the CUPA kept running out of memory (despite getting COLAB Pro 🥲) and it worked, however there is still some "robotic feedback" at higher points in the song that make it sound like it is glitching, but im thinking that maybe i can just add a glitch effect to my videos at those points or have a producer mix those bits out? If you have any suggestions on removing unnatural sounds from the final trained vocals, that will help alot. It could be possibly from mic feedback when recording singing vocals, but I am not sure. Your videos are *amazing* 🎉
@@Sahgee some voices have pitch limitations and can't hit pitches that the model hasn't seen, that or the input audio is a little too hard for it to match 🤟
@@Jarods_Journey thank you for responding! Do you have recommendations for combatting this issue? I was thinking of retraining my model with even more vocal (singing) samples of my voice in better quality including a similar rendition of the source song that I am trying to clone to my voice. Do you think that will help?
@@Sahgee adding more high quality data always helps, but if you're trying to clone a specific song(input song), listen to the input song to see if there are any weird artifacts with the vocal separated file. I've noticed that can cause models issues
What is the difference between 2 audio.wav file in TEMP folder and why we need to wait both of them?
Those are the outputted files from the webpage. If you don't download them, they get deleted.
06:45
Instead of the link I get an error
Traceback (most recent call last):
File "/content/Retrieval-based-Voice-Conversion-WebUI/infer-web.py", line 87, in
class ToolButton(gr.Button, gr.components.FormComponent):
AttributeError: module 'gradio.components' has no attribute 'FormComponent'. Did you mean: 'IOComponent'?
So I guess that's where my attempt ends
There's an issue with the latest GitHub repository, seems like it broke the colab version.
You might wanna check the fix that people said about here: github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/issues/549
@@Jarods_Journey Thanks, it worked!
How do I get the pro version? Will there be a free version?
I have a fully completed model with 270 epoch but couldn't build on that to have more epochs ?
If you could help me how can I do that ?
Sorry looks like my original shorts response was for local installation. For colab, I'll have to get back to you on it.
@@Jarods_Journey thanks in advance ..
i was getting lots of bugs on my windows laptop with amd hardware, so this is a good alternative
Is there a way to use the model you train here in a normal text to speech engine? If yes, can you point me in the right direction for that?
Sorry, unfortunately not. You'll wanna check out my videos on tortoise TTS to RVC pipeline video
I've watched a different tutorial which didn't help me at all. Sound was robotic, and generally no much explanations whatsoever.
But this one helped me, and the trained voice sounds perfect! I am extremely thankful for this.
You are the best! :D
Appreciate it 🙏!
FileNotFoundError: [Errno 2] No such file or directory: 'pretrained/f0G40k.pth' which was already in that column
Go through the installation cells again and make sure no errors there. That's saying it failed to download some files from hugging face
Is it possible to do Text-To-Speech using RVC?
You may be executing code that is disallowed which may terminate your runtime without warning. Colab prioritizes interactive notebook compute and disallows some types of usage when executing code without compute units as outlined in the FAQ.
Your compute unit balance is 0.Purchase more
I am continuously getting this message
Please help