Realtime AI Voice Changer Using RVC (Retrieval-based Voice Conversion w./ w-okada)

แชร์
ฝัง
  • เผยแพร่เมื่อ 31 ธ.ค. 2024

ความคิดเห็น • 2K

  • @Jarods_Journey
    @Jarods_Journey  ปีที่แล้ว +353

    README!! Not downloading? 👇The VC is continually being updated so the version showed off in the video is no longer available. If you run into errors, you may have to try out the other versions to see if that resolves issues.
    Latest version as of this update: 1.5.3.8a
    I expect that I'll have to make a follow-up video.
    If the google drive link is down, use hugging face website. Look at the names there and determine what to download based on:
    cuda - Nvidia
    directml - AMD
    mac - MAC

    • @nokinirus
      @nokinirus ปีที่แล้ว +10

      Ah yes. The Mac uses mac, with a side of mac.

    • @giggy_rook
      @giggy_rook ปีที่แล้ว +10

      i downloaded it but it wont show my gpu it only shows my cpu and yes i use amd

    • @nokinirus
      @nokinirus ปีที่แล้ว

      @@giggy_rook bro check if you dl-ed the cpu version. Because the first one says it'll work with your cpu, the mid one is cuda, and the last one's amd.
      Edit: if you're downloading from the archive...

    • @kaoruofficialtv
      @kaoruofficialtv ปีที่แล้ว +3

      @@nokinirus i also have amd and it doesnt show the gpu only cpu. and yes i downloaded the directml one. I even tried downloading other versions and different types

    • @giggy_rook
      @giggy_rook ปีที่แล้ว

      @@nokinirus still doesnt work

  • @NoztrozeR
    @NoztrozeR ปีที่แล้ว +4372

    I have a creeping suspicion that the vtuber market is going to get a whole lot weirder with this tech improving.

    • @Everfalling
      @Everfalling ปีที่แล้ว +443

      those voice changer jokes are gonna be legit now

    • @Reydriel
      @Reydriel ปีที่แล้ว +141

      TBF there's a popular one that literally all AI atm, though her creator has arguably become even more popular lmao, it's kinda nuts what he's been able to make

    • @harrytsang1501
      @harrytsang1501 ปีที่แล้ว +17

      Have always been

    • @CyberMonkey03
      @CyberMonkey03 ปีที่แล้ว +42

      @@Reydriel Neuro

    • @VallenChaosValiant
      @VallenChaosValiant ปีที่แล้ว +37

      In reality there are no shortage of women willing to do the job. Although many of them are self concious and still use a voice changer just to make themselves sound cuter/younger/whatever they felt inadequate about. Don't forget that 50% of the world are women and plenty find vtubing appealing.
      In real life the highest Youtubing earners are MEN, so if anything you lose money by voice changing into a female.

  • @amruzaky4939
    @amruzaky4939 ปีที่แล้ว +723

    I'm not ready for fluently English speaking Marine-senchou.

  • @Phoon1G
    @Phoon1G ปีที่แล้ว +1347

    Under Audio options i found that choosing Server instead of Client, makes you sound a lot more realistic and takes away most of the robotic features

    • @zak_facts2676
      @zak_facts2676 ปีที่แล้ว +5

      where do i find that
      ?

    • @noobio5510
      @noobio5510 ปีที่แล้ว +27

      @@zak_facts2676 pretty sure its right under the S.Thresh, there a option next to AUDIO: for client or server

    • @Mrstan45
      @Mrstan45 ปีที่แล้ว +3

      yes but it messed with my audio setup though

    • @aspen-
      @aspen- ปีที่แล้ว +4

      @sorasong6780 under the download there is a "huggingface" button if you click that it works :)

    • @elpis8784
      @elpis8784 ปีที่แล้ว

      What should I put the audio output to? There's MME but idk what that does

  • @ElemXCR
    @ElemXCR ปีที่แล้ว +481

    Some japanese VAs agencies are hammering down on their VAs' AI voices.
    It's gonna be interesting to see where this will go. While I hope I'd be able to use some of my favorite JP VAs voices and have it for english content, I'm betting there will be much more worse abusers and signing it off as their own.

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +64

      Rules and regulation are gonna be needed for sure, but there's no legal precedent for this yet so it's all up in the air for how it's gonna be dealt with in the courts. As with all things, there will be bad actors and despite me looking, I am yet to find any good resources or detection tools that can keep up with these advances.

    • @SamiTheAnxiousBean
      @SamiTheAnxiousBean ปีที่แล้ว +21

      and I mean...somewhat rightfully
      If someone isn't comfortable with their voice being replicated, you shouldn't do so/leave a replication up

    • @bendover9620
      @bendover9620 ปีที่แล้ว +12

      Always remember when dealjng with strict japanese laws:
      They can't touch you if you're outside their country or if your country has no reciprocation laws to back them up.
      If you get copystriked, just make another account ad infinitum, assuming you're anonymous.
      When all hope is lost, the worst-case scenario is to upload on BiliBili. Let's just say China and Japan aren't really on speaking terms.

    • @ridervtb
      @ridervtb ปีที่แล้ว +4

      @@Jarods_Journey how do you get more ai models? i cant seem to find other voices to download

    • @TheMastertbc
      @TheMastertbc ปีที่แล้ว +2

      imagine buying license for gura voice

  • @sitearm
    @sitearm ปีที่แล้ว +38

    I very much like hearing the actual before and after effects and the detailed walkthrough. Thank you for posting!

  • @Fahad-21
    @Fahad-21 ปีที่แล้ว +242

    It works but latency is pretty high. Lowering chunks improves that but you lose a lot of the content of what you are speaking. One thing to note is to make it most natural sounding, always tune it to a number that is closest to the voices natural sound. Like around 22 for that first model. Also any idea how to turn off real time playback? It's easier to use the record and then playback for any projects.

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +42

      If you don't need the realtime functionality of it, you might be better off recording audio and then converting them in the RVC interface. You could always increase chunk size and there is a record fucntion on the client.

    • @_Chessa_
      @_Chessa_ ปีที่แล้ว +3

      @@Jarods_Journey this is very helpful knowledge thanks for this.
      And thanks for asking this question also.

  • @VIPPyroTM
    @VIPPyroTM ปีที่แล้ว +790

    Holy shit, the amount of power of turning into a Hololive Girl is getting closer!
    Also, that Marine voice when she’s speaking English fluently is just so damn uncanny to imagine that there’s a timeline where Marine learned English SUPER WELL.
    I hope there’s a program that compiles all the complicated setup into an easier way of setting up since I’m tech savvy 😂

    • @DunceInAwhile
      @DunceInAwhile ปีที่แล้ว

      True. Now people are going to view Hololive creators a little differently... Especially since most of the Hololive girls go to great lengths to hide their true identity. Makes you wonder...

    • @DoffDoffinson
      @DoffDoffinson ปีที่แล้ว

      @user-xp9kq7xb6p You'll do it to me >:)

    • @niahonjou1933
      @niahonjou1933 ปีที่แล้ว

      help,i have too much errors

  • @sabereaseera1384
    @sabereaseera1384 ปีที่แล้ว +15

    Recommended to me randomly. You are super underrated.

  • @chazington2
    @chazington2 ปีที่แล้ว +6

    i did not know a tutorial video can be this pleasent and nice, i know this is a weird compliment but you're really good at making tutorial videos

  • @rolfathan
    @rolfathan ปีที่แล้ว +22

    This is going to be so great for online role playing games. Being able to make a custom voice that you can use that matches your avatar will really increase immersion.

  • @cartoonhyperfixated
    @cartoonhyperfixated ปีที่แล้ว +95

    This is insane 😭 crazy how people can replicate voice’s by using AI in real time

    • @lonelybookworm
      @lonelybookworm ปีที่แล้ว

      ​@@freedomofwordbruh 2 seconds is real-time for most purposes

  • @PaxPolaris-kt1vr
    @PaxPolaris-kt1vr ปีที่แล้ว +36

    This was incredibly helpful! I seen your video on TikTok and came here right away. Thank you so much for making this video; I couldn't of figured out that program without it!

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +4

      Appreciate it. I'm surprised at how much traction it gained haha.

  • @Skiedragon
    @Skiedragon ปีที่แล้ว +112

    If anyone is experiencing very choppy sound, like your voice cutting off after every 'chunk', you can try changing the AUDIO to "server" instead of "client". Eliminated all choppiness for me.

    • @ge2719
      @ge2719 ปีที่แล้ว +4

      Doesn't that mean your using server somewhere, and likely giving them all your audio data you're creating?

    • @Tryharding69
      @Tryharding69 ปีที่แล้ว +1

      @@boombattlefields9123 ☠ bro... let's go

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 ปีที่แล้ว +2

      Just don't worry about it. If you've ever mysteriously had an advert for a product you just mentioned, outloud near an active device, pop up in your feed, you're already having everything you say parsed by some sort of analytical algorithm. This, while an additional outgoing stream of data from you, is at least one that you are aware of and have some control over.
      The only thing I could really do to guarantee my phone isn't listening to me type, even this sentence right now, is to put it in the microwave to block any outgoing signals. At least all you have to do is shut off the program and they aren't able to parse your data anymore.

    • @denjiaisaka2186
      @denjiaisaka2186 ปีที่แล้ว

      can you help i cant hear my voice in program

    • @Kopie0830
      @Kopie0830 ปีที่แล้ว

      Tried this and there seems to be no change in the voice even after changing the tunes hmm...

  • @DjTonioRoffo
    @DjTonioRoffo ปีที่แล้ว +3

    Your Chopping is because of threshold set all the way up. It does a cut off of the input under a certain volume. Make it a lot lower (almost completely at the other side actually)

  • @0AThijs
    @0AThijs ปีที่แล้ว +61

    2:25 for anyone wondering why it's stuck at
    Booting PHASE :__main__
    Voice Changerを起動しています。
    please wait, it may take a few minutes.

    • @shat01j
      @shat01j ปีที่แล้ว

      Great help Thanks!

    • @shinigamiwolfen
      @shinigamiwolfen ปีที่แล้ว +1

      日本語上手

    • @Ryyza7
      @Ryyza7 ปีที่แล้ว

      @@shinigamiwolfen hahah nani kore got jozued

    • @ミカ-m9p
      @ミカ-m9p ปีที่แล้ว

      thanks literally after reading this it finally did it haha

    • @frits4061
      @frits4061 ปีที่แล้ว

      Thnx for the tip I was searching for!

  • @hellfrozen3678
    @hellfrozen3678 ปีที่แล้ว +126

    I swear github is like the holly grail,I just learned about it recently but now I realise that every kind of software can be obtained from there and for free

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +24

      Yuppppp, hometown of lots of open source and many, many awesome things on there.

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 ปีที่แล้ว +3

      Is it weird I've seen this exact comment, word for word, pop up on almost every video related to AI in the last few weeks? xD

  • @piplupsuper0
    @piplupsuper0 ปีที่แล้ว +31

    Jarod thanks for these videos!
    you've really helped me out a lot appreciate your content man it's fun keeping up with the new stuff you showcase!

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +2

      Appreciate it man! It's all wild and crazy tech and it's an adventure everyday checking these things out!

  • @infalogger9697
    @infalogger9697 ปีที่แล้ว +8

    the reason it says smartscreen protected you is because the dev hasent signed the app with microsoft, but thats because doing that costs 300 a year

  • @unedited12
    @unedited12 ปีที่แล้ว +4

    Your stuff sounds SO much cleaner than mine, and I even try to use a very clear voice

  • @ResmondSam
    @ResmondSam ปีที่แล้ว +51

    Just wondering, are there any resources online where people can post their own trained voice weights? It'd be convenient as you won't have to keep training your own voice for the voice changer in case somebody else already happened to do so.

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +13

      Someone let me know of one called AIhub discord group

    • @SynFuZe
      @SynFuZe ปีที่แล้ว +1

      @@Jarods_Journey is there a quick invite link anywhere? I can't seem to find the group anywhere

    • @literailly
      @literailly ปีที่แล้ว

      Anything on huggingface?

  • @RuTo94
    @RuTo94 ปีที่แล้ว +38

    It’s crazy to believe that there’s actually people that design voices for these Vtubers to design to there preference on how they want it to sound. More power to them.

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 ปีที่แล้ว +13

      It will be used for this purpose, yes. However, it actually exists so that Horny men can privately moan at themselves in Waifu-speak.

    • @SirGlazer
      @SirGlazer ปีที่แล้ว +9

      @@tripleheadedmonkey420bro why did you put this idea in their head

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 ปีที่แล้ว +8

      @@SirGlazer "Their head" he says while desperately trying to hold back the tears as his Tsundere anime waifu life begins anonymously.

    • @SirGlazer
      @SirGlazer ปีที่แล้ว +4

      @@tripleheadedmonkey420 😭

  • @celestraic
    @celestraic ปีที่แล้ว +8

    Tsukuyomi-chan's project & her creator are so inspiring! She is a free voice project across a whole number of engines, mostly any free Japanese speech & singing synthesis programs. I definitely recommend that people check out some of her other resources & samples because she is really a treasure of a voice!

  • @Chrispyy__
    @Chrispyy__ ปีที่แล้ว +18

    For those of you watching this and you cant see your GPU listed under the GPU tab this is what you do. Where the Audio section is where it says "Client or server" click on server, go back to the GPU tab to make sure your GPU shows in the drop down list, and then you can click back to client or leave it on server. It worked for me.

    • @jamesduke151
      @jamesduke151 ปีที่แล้ว

      what gpu do u have? AMD or Nvidia

    • @Chrispyy__
      @Chrispyy__ ปีที่แล้ว +1

      @@jamesduke151 AMD Ryzen 5 3600

    • @jamesduke151
      @jamesduke151 ปีที่แล้ว

      @@Chrispyy__ does a drop down menu for the GPU appear like in the video for you? On mine there is a 0 1 2 3 instead

    • @Chrispyy__
      @Chrispyy__ ปีที่แล้ว

      @@jamesduke151 mine just shows my GPU name. I don’t have any numbers

    • @jamesduke151
      @jamesduke151 ปีที่แล้ว +1

      @@Chrispyy__ ok thanks. Are you using the latest version?

  • @gallanomarkandrea.1787
    @gallanomarkandrea.1787 ปีที่แล้ว +1

    Sloppy Walrus your a MENACE to society for setting this up for your video XD

  • @realjgerard
    @realjgerard ปีที่แล้ว +186

    As a highly trained vocalist, I have been waiting for this to be a reality so that I can create cover songs that one could only dream to hear, like Freddy Mercury, Curt Cobain, and Steve Perry singing on the same ballad!! If anybody seeing this has the capability to train voices and would like to collab on a project, let me know! I’ve yet to figure out the training but I have an entire professional quality studio set up and I’m ready to get to sangin!! Let’s GOOO!!!! 🚀🚀🚀🚀

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +22

      Funny you comment this because a video I'm gonna be releasing is talking about the potential to turn my untrained voice into something that is bearable... simply by using a trained model xD. Another use case is say you throw a bunch of filters onto a voice and don't want to do post-processing ever again. Well, if you just get enough voice samples... you could essentially just "sing" and then BOOM, it's all edited. Still a little bit of issues ofc, but.......... it's super exciting lol.

    • @kylespevak6781
      @kylespevak6781 ปีที่แล้ว +11

      People have already been doing similar with rap. It's definitely cool!

    • @GNR_Fan
      @GNR_Fan ปีที่แล้ว

      @@Jarods_Journeycount me in… I am chasing the real time to help how song like AXL ROSE… how can I help to make real time effect a reality?

    • @neek01
      @neek01 ปีที่แล้ว +1

      Personally for me i’d use it for music production, so so much easier to draft a song when you can hear fitting voice with it for an actual artist to sing later. I usually sing a bit myself but having a fairly low male voice, i can never do a female voice

    • @krakentren7988
      @krakentren7988 ปีที่แล้ว

      Hey, i am a music producer, where can I contact you? This is my first private account

  • @rommix0
    @rommix0 ปีที่แล้ว +8

    I've started using RVC for some of my videos. I was able to change Clint's voice (Clint from LGR) to Duke Nukem's voice for a Duke Nukem review he did some years ago.

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +1

      Haha that's awesome. RVC is quite good so I can see it being used in a lot of places.

    • @rommix0
      @rommix0 ปีที่แล้ว

      @@Jarods_Journey Definitely. Compared to SVC, it's the best in regards to replicating consonants with the least amount of smearing.

  • @AlvinTheLAW
    @AlvinTheLAW ปีที่แล้ว +18

    That was so weird hearing Senchou speaking native-like english

  • @MartHommes
    @MartHommes ปีที่แล้ว +17

    I went along with this tutorial and everything went smoothly until i opened the program. Above I only have the "clear settings" button when there should also be "reload" and "select vc". My screen looks like 3:07 without those buttons and without the voices to choose from. The "edit" section for the voices is completely empty for me and I'm now stuck and don't know what to do since I'm not too advanced when it comes to computers. Does anyone know how I could fix this?
    EDIT: Nevermind I fixed it! If anyone else ran into this it's easy to fix. Under "NOISE" you have the "F0 DET." thing. It's on "dio" by default and when you switch it to one of the other modes the different voice models will appear.

  • @RandomGuy0987
    @RandomGuy0987 ปีที่แล้ว +4

    WOAHHH 6:25 that's totally Marine's voice speaking fluent English. Crazy.

  • @espae_
    @espae_ ปีที่แล้ว +6

    can you make a tutorial or is there already one of how to make your own model for real time talking? i know there's ones for singing but if I want a better talking model how much data should I use? podcasts maybe?

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +2

      I do have to link it here: th-cam.com/play/PLknlHTKYxuNshtQQQ0uyfulwfWYRA6TGn.html
      The same models used to train in RVC can be either singing or talking models, just depends on what audio data you curate and train it with. I recommend start with 10 minutes of super, high quality data that is clear and then increase it if the model isn't good enough.

    • @Alice_Fumo
      @Alice_Fumo ปีที่แล้ว +1

      It's a bit of a crapshoot. I've somehow had amazing results with something like 2 minutes of ludicrously high quality audio data and not quite as good results with several hours of also very high quality data.
      It seems that there are a few types of voice which just happen to work better.
      Whatever you do, make sure to only use data which is as good as you can get.

  • @akiodemon
    @akiodemon ปีที่แล้ว +43

    The catfishing is gonna be wild..
    Anyways, thank you for uploading this video for others like me to see, It is gonna be cool to try out.

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +16

      Immediately one of the first things I thought about lol, it's gonna get wild. But also, the more you're in the know, the less likely you're to fall for any types of these things as well.

    • @SDT493
      @SDT493 ปีที่แล้ว +1

      LOL ME

    • @Rinno-sempai
      @Rinno-sempai ปีที่แล้ว +2

      so more males are gonna be applying to be part of a middle range agency (that cannotnmake too much background check( using these filters lol
      Catfishing and also contract breaking (one female can work at 2 or 3 agencies without her being voice recognized lmao)

    • @MaxKrovenOfficial
      @MaxKrovenOfficial ปีที่แล้ว +2

      This is gold for the Vtubing community, actually.

    • @dra6o0n
      @dra6o0n ปีที่แล้ว +1

      @@MaxKrovenOfficial It's also bad because agencies and companies wants to see you literally in person in order to setup any sort of contracts or deals, but it also opens up to scams and such because you can impersonate other people very easily...
      For instance it might hide the indian scammer's bad accent and fool a lot more people who are usually aware of these people and their bad voices.

  • @YurgenGrimwood
    @YurgenGrimwood ปีที่แล้ว +6

    I give it 5-10 years and we can just prompt a website to generate media to consume. At least that means I can finally get a second season for all those shows that didn't get one...

  • @nkozifraser2331
    @nkozifraser2331 ปีที่แล้ว

    whoa! good to see you finally get the views you deserve brutha!!

  • @gamecreator7214
    @gamecreator7214 3 หลายเดือนก่อน +3

    If you don't have the voice actor icons, you downloaded a past version or server ( I am incompetent, don't ask me). You need to download a client version and it is at the same page in the start and currently will direct you to download it from hugging face. It is 2.+ version. It helps if you choose english on the git page... Took me a day to figure it out, never again.

  • @crusader_gaming8273
    @crusader_gaming8273 8 หลายเดือนก่อน +7

    Discord nitro here I come

    • @Ozzy622
      @Ozzy622 2 หลายเดือนก่อน

      Free money, here i come!

  • @jackyisking
    @jackyisking ปีที่แล้ว +54

    For the song covering maybe, but this seems crossing the line into creepy. 😂

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +9

      It was much, much better than I thought 😅, but mind-blowing tech nonetheless.

    • @Tilt_TM
      @Tilt_TM ปีที่แล้ว +4

      This would be hilarious for messing with people in VOIP games like Battlebit Remastered

  • @itsonlyjurko4080
    @itsonlyjurko4080 ปีที่แล้ว +2

    2:24 when i open the bat file, for some reason the download you mentioned isnt starting, is there any way to fix?

    • @mtnocap7114
      @mtnocap7114 ปีที่แล้ว

      This is only a problem with the new file, download the old version and everything will work

  • @greenish16
    @greenish16 ปีที่แล้ว +4

    Cool video!! Try to make more often longer videos, more fun and exciting! ❤️

  • @KebabTM
    @KebabTM ปีที่แล้ว +18

    For the index option, it will improve your quality. It wanted you to choose the file starting with added_IVF5870 rather than the npy file.

  • @Reydriel
    @Reydriel ปีที่แล้ว +3

    Still has a very distinguishable "robotic quality" to it, but that will probably improve

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +2

      This is the worse the technology will ever be... so yeah, a bit spooky.

  • @StarAllKungfu
    @StarAllKungfu ปีที่แล้ว +1

    This would be awesome for online TTRPG's. As a deep voiced male, the best I can do is an intimidating Hag. I'd like to get some other female voices.

  • @YonasanErihhi
    @YonasanErihhi ปีที่แล้ว

    Such a great video, Thank You very much bro!

  • @nihilvt
    @nihilvt ปีที่แล้ว +7

    I thought it would be REALLY good, but I didn't realize it takes more resources than Chrome and Photoshop. As soon as anything else needed some GPU, it started stuttering and became unusable. I hope it gets good enough to not need more than 12GB of VRAM.

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +1

      You might be able to offload it to run on CPU instead of GPU, but yeah, most of these AI projects are pretty compute hungry.

    • @forest1605
      @forest1605 ปีที่แล้ว

      @@Jarods_Journey how

  • @cambeckett
    @cambeckett ปีที่แล้ว +3

    this is super cool!! is there a way to use this as an input for a discord call or something?

  • @neo7538
    @neo7538 ปีที่แล้ว +1

    hearing Hoshio Marine speaking fluent english is something I did not think my brain could comprehend, holy shyet

  • @harurosech.4848
    @harurosech.4848 4 หลายเดือนก่อน

    I just used the voice of this video on my phone to configure my settings. Thanks

  • @Cheqipeqi
    @Cheqipeqi ปีที่แล้ว +4

    Where do I find the voices like Botan and Marine? Would love to get em! (aswell as ur settings with em)

    • @mecchamina
      @mecchamina ปีที่แล้ว +4

      Exactly what I was about to ask!

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +1

      Appreciate it guys, but I can't distribute the models unfortunately! However, I can share the knowledge required to train the models and I have those videos on my channel. I'm working to get it all a bit more organized, but you'll have to gather audio data on your own (thought there are plenty of tutorials on how to get audio data out there).

    • @Boredness90
      @Boredness90 ปีที่แล้ว +1

      @@Jarods_Journey if you cant distribute it why even make the video at all or even showcase it LMFAO

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +2

      @@Boredness90 It's educational content and falls under fair use. Distribution does not, falls under more murky waters.

  • @RubySapior
    @RubySapior ปีที่แล้ว +3

    Both Crepe and Harvest seem to be both cpu dependent.
    CPU runs at like 100% Ryzen 7 5700G
    While gpu is at like 38% gtx 1080 ti

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +1

      It's an odd thing, but CPU draw seems to go up when using it selected on my 2070 super I've noticed. Dunno why, but it doesn't show GPU usage. Might be something to raise to the author eventually though.

  • @bombadt-yt9818
    @bombadt-yt9818 ปีที่แล้ว +1

    Awesome, I'm sure this will be put to a very very very.. Very good use.

  • @Smith0j
    @Smith0j ปีที่แล้ว +1

    I've done everything but when I get 2:22 here the black menu doesn't show up

  • @qwerty9567
    @qwerty9567 ปีที่แล้ว +3

    For some reason my client doesn't have the "Select VC" button to select RVC. Does anyone know how to fix this? I can see the deafult models downloaded in the files but they don't appear on the client as RVC isn't selected. I've also realised that it doesnm't seem to be detecting my GPU as the CPU selection is the only on in the list

  • @nyanbrox5418
    @nyanbrox5418 ปีที่แล้ว +6

    one interesting thing is this could theoretically be combined with a translator first, though that would take a whole new, probably larger model,
    As hardware and software improves, this is just the beginning!

  • @Adorablybadmemes
    @Adorablybadmemes ปีที่แล้ว +3

    For some reason, whenever I try to use any of the voices there's a lot of background noise/static, that seems to be coming from nowhere.

    • @ItsMeCharkey
      @ItsMeCharkey ปีที่แล้ว +1

      Yeah I either hear nothing or just static for me as well

    • @CertifiedAsher
      @CertifiedAsher ปีที่แล้ว

      Dont use cpu, be sure to have good gpu : (

    • @Adorablybadmemes
      @Adorablybadmemes ปีที่แล้ว

      @@CertifiedAsher I'm using the GPU version, and my GPU should be plenty to process it at lower bitrates at least, but it always ends up sounding staticy.

    • @sxteya
      @sxteya 11 หลายเดือนก่อน

      ​@@CertifiedAshercope

  • @quelidle3772
    @quelidle3772 6 หลายเดือนก่อน +2

    having a problem, after starting start_http for the first time it said it failed because it could not find win.api or something like that. Now when i try to run start_http it opens for 1 second and immediately closes

  • @8teapi
    @8teapi 10 หลายเดือนก่อน

    This was well done.. I was able to install and try it out

  • @JDizon849
    @JDizon849 ปีที่แล้ว +4

    Any ideas on how to get this to output as a virtual microphone? This could be really fun in discord.

    • @Fs3i
      @Fs3i ปีที่แล้ว +1

      VB Cable / Virtual Audio Cable - should be easy to find with google

    • @ociones
      @ociones ปีที่แล้ว

      LOL

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +4

      :) th-cam.com/video/IS_SPQVv5iY/w-d-xo.html

    • @marufranco5281
      @marufranco5281 ปีที่แล้ว +1

      maybe set your recording device as Stereo Mixer , i don't know, it might work

  • @stnhndg
    @stnhndg ปีที่แล้ว +6

    Yep, Japanese voice models work better with Japanese. I mean... they were trained on Japanese speakers ))
    This is most noticable with consonants, since those are usually treated differently in those programms (e.g. unvoiced consonants don't have pitch). For example, I couldn't make an English model to pronunce 's' from my language, or any variations of it actually (regarding tongue position being more forward/backward). But with vowels it followed my speach pretty close even if those vowels were not typical for English phonetics... with some exceptions on high vowels (pretty decent though). Tough the last problem might be due to relative lack of palatalized consonants in English.

    • @wargreysama
      @wargreysama ปีที่แล้ว

      I tried speaking Turkish with it and it works just fine, probably due to the fact that Turkish is pretty similar to Japanese when it comes to pronounciations and stuff.

    • @stnhndg
      @stnhndg ปีที่แล้ว

      @@wargreysama To be honest Turkish is close to Japanese even at grammar (up to some degree) ))
      It works pretty decent with many languages. I was just curious about possible limitations and since I'm a bit into languages I tried to -make poor anime girl suffer- to play with different sounds non-typical for Japanese.
      As for now - my favorite thing is a word 'tractor'. Those voices make it more like 'toractor' which is adorable )

  • @kitsune-ame92
    @kitsune-ame92 ปีที่แล้ว +2

    I have a question, how to you get the trained voice material? I mean where you download the Marine's Voice(But I'm not finding Marine)

  • @LovelyNyx7
    @LovelyNyx7 ปีที่แล้ว +2

    Whenever I speak into it. I can hear the voice quite well the only issue is that after I stop speaking a second later it will play a very quiet voice of it back to me. It only does it with one voice tho so I'm assuming it's just something to do with that voice.

  • @onlydistant
    @onlydistant ปีที่แล้ว +3

    Is it possible to use this with audio files, in terms of converting the audio file to the respective voice?

    • @trent-po8qm
      @trent-po8qm ปีที่แล้ว

      yes, in the input, you can select file. for me it errored the 1st time but after reloading, it let me select a file from my computer as the input, and record the converted audio by clicking the record / save button on the bottom to get the converted output

  • @9a8szmf79g9
    @9a8szmf79g9 ปีที่แล้ว +20

    That certainly gives V-Tubers a break. I don't believe there's any way they could do the same thing everyday and not become even a little tired of it. I don't usually watch most of them exclusively, sometimes clips; but for example, I watched from the last 2.5 hours of Mumei's livestream karaoke and she was already tired and bored after the 1st hour when I joined their stream; not that I'd know what she's like but it definitely seemed like it was possible that it was someone else filling in for the night using such a voice changing program.

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +9

      Ah, it's not that good to be at that level yet, you can definitely tell when someone is trying to use an AI voice still but to this point, I think V-tubers are youtubers so even if you're an IRL streamer, it's not like you have someone else sub in for you when you're tired xD

  • @graysonnguyennzz2903
    @graysonnguyennzz2903 ปีที่แล้ว +3

    Hi Jarods, I have tried to download this voice changer multiple time. But everytime i click at the drive (normal) it is not working. It say that too many people downloading this file which lead to failure when downloading. Please help me of how to download this if this way is not working. Thank you,

  • @naifbashin3099
    @naifbashin3099 ปีที่แล้ว +1

    FINALLY I can do Jack Sparrow vs Barbosa Standoff in Red Dead Online

  • @Marin_Mewz
    @Marin_Mewz ปีที่แล้ว

    It's cool, I really admire someone like you❤

  • @ivaniousivanious6234
    @ivaniousivanious6234 ปีที่แล้ว +6

    Hey guys, I wonder, can you just use an audio input instead of real time voice so that it still mimics your intonations? Or maybe there is some other software that can help adjust intonations?

    • @Margen67
      @Margen67 ปีที่แล้ว

      Owls need HUGS

    • @IelmaoUfo-lp9bd
      @IelmaoUfo-lp9bd ปีที่แล้ว

      Use the standard rvc, so Vita inference in python or in a ui like rvc GUI.

    • @Glutzz
      @Glutzz 11 หลายเดือนก่อน

      can you explain that more @@IelmaoUfo-lp9bd

  • @patrickdailgarcia2500
    @patrickdailgarcia2500 ปีที่แล้ว +4

    Hey Jarods! A fellow Mechatronics Engineering Grad here, you make a lot of quality content and I wish to message you regarding some technicalities of AI voice cloning. And maybe some career advice for degree holders in Mech? haha
    Where can I reach you?

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +1

      Hey Patrick, always great to see a fellow mecha :D! I would say linkedin is going to be the best bet for professional stuff, if not, there, then the next best bet is discord as I'll generally respond on there. I do get a lot of PMs but it shouldn't be a problem if you pmed me from my group.

    • @patrickdailgarcia2500
      @patrickdailgarcia2500 ปีที่แล้ว

      Gotcha! How do I find you on Linked In btw hehehe

  • @nikosurfingYT
    @nikosurfingYT ปีที่แล้ว +4

    Wow thank you for making this tutorial. I'm wondering can I add more models? If so, where to find it?
    Subscribed!!!

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +2

      Appreciate it! I recommend you train models, but there's a discord group called AI Hub where you can go find some models

    • @nikosurfingYT
      @nikosurfingYT ปีที่แล้ว +1

      @@Jarods_Journey thanks, super excited about this

  • @bluepearl5312
    @bluepearl5312 ปีที่แล้ว

    Oh, my god, first time get to know this voice change programm, thank you!
    I'm so shock.😂 So.... i think most 95% of vtubers, both boy and girl use this, especial in japan company...

  • @Dennis-qh1sr
    @Dennis-qh1sr ปีที่แล้ว

    For those who have an AMD graphics card, when you set up the whole software and have a model selected, open your Task Manager and test the said model while switching from GPU1, GPU2 etc.. One of those is your graphics card, so it won't have to use your CPU. I have an RX6800XT and still couldn't find my GPU and it was lagging due to it using my CPU. Following the steps above will sort that out, at least it did for me. GPU1 for example had my CPU at 80%. GPU0 on the other hand had my CPU at 20%, which means that GPU0 is actually my RX6800XT.

  • @T4EKO
    @T4EKO ปีที่แล้ว +5

    Figured out the link, but do you have any advice for getting clearer audio? When I speak it chirps and distorts pretty constantly (sort of like when you lowered the chunk down really low, but I get that effect in all chunks)

    • @T4EKO
      @T4EKO ปีที่แล้ว

      this is both with custom pth files and the provides stock ones

    • @kennethnathantagalog5597
      @kennethnathantagalog5597 ปีที่แล้ว

      it has to be your gpu

    • @T4EKO
      @T4EKO ปีที่แล้ว

      @@kennethnathantagalog5597 its set to my GPU (seems to be putting the load on my CPU anyway)

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว +2

      Might be hardware specs, I'll be going over this a little bit more in a vid

  • @kenstar222
    @kenstar222 ปีที่แล้ว +4

    This is truly an amazing find and piece of software, I would be very interested in messing around with it but unfortunately I have an AMD build and I cant find a way to use my 5700XT gpu to process the sounds, and it doesnt seem to be fairing well with my Ryzen 7 2700X cpu :( Any potential help would be greatly appreciated!

    • @GondoMan21
      @GondoMan21 ปีที่แล้ว +4

      i have the exact same build let me know if you find anything ahaha

    • @kenstar222
      @kenstar222 ปีที่แล้ว

      @@GondoMan21 will do, likewise -so far no luck but I will do some checking each day and come back with anything I learn

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว

      Might have to adjust the settings in the client to try and help it out, but it doesn't run too well on CPU unfortunately. Did you download the directml version? That would should support AMD.

    • @Diamartin
      @Diamartin ปีที่แล้ว +1

      @@Jarods_Journey well, it doesn't

  • @titomo5854
    @titomo5854 9 หลายเดือนก่อน +4

    DUDE RTX 4090 🙂

  • @soul_anims7196
    @soul_anims7196 ปีที่แล้ว

    its sounds like a voice over love it

  • @aishams3
    @aishams3 ปีที่แล้ว

    Awesome!
    Is there a colab notebook for "realtime" voice changing? I saw repo of so-vits-svc-fork but this is not work for "realtime" voice changing.

  • @oscarreyes4511
    @oscarreyes4511 ปีที่แล้ว +7

    This is the main reason why I rejected my banks offer to secure my bank account using my voice over the phone! AI is freaking scary in the wrong hands!

    • @Jarods_Journey
      @Jarods_Journey  ปีที่แล้ว

      I have yet to try it on voice security systems... but that'll be an interesting topic to explore.

    • @oscarreyes4511
      @oscarreyes4511 ปีที่แล้ว

      @@Jarods_Journey You can use it to change your voice live and make a phonecall. That is how a Chinese investor got tricked and lost a ton of money. He thought he was talking to his business partner and sent him money for a business deal. The crook even facetimed the victim using a deepfake of the business partner face!

  • @csolisr
    @csolisr ปีที่แล้ว +14

    Welp, this is going to put old voice actors out of a business, but on the other hand it's also going to allow VAs to be easily replaced in case of illness, death or jail sentence (yes that last one has happened)

    • @billionaeris1183
      @billionaeris1183 ปีที่แล้ว +2

      AI will erase many jobs

    • @forest1605
      @forest1605 ปีที่แล้ว +5

      i mean real life people can still say a vowel for a long time without fail so

    • @muzz4355
      @muzz4355 ปีที่แล้ว +1

      they still need the datasets to train the ai with which will need VAs to make so they will still have jobs just making datasets rather than the exact lines

    • @csolisr
      @csolisr ปีที่แล้ว

      @@muzz4355 Which is why I specified *old* actors are out of a business - they have plenty of recorded voice to train their doppelgangers on. Newer actors are safer in virtue of having less data to train on.

    • @muzz4355
      @muzz4355 ปีที่แล้ว +1

      @@csolisr its less the VAs that are in danger but rather the specific characters they voice. a VA is always changing tone, accents etc between characters . Old actors will still be wanted to come in for new characters but less likely to return to their existing characters.

  • @jaymosupreme
    @jaymosupreme 9 หลายเดือนก่อน +2

    4:27 Sounds like an old lady who just finished giving a toothless deepthroat gum job to a BBC.

    • @1mortar1
      @1mortar1 9 หลายเดือนก่อน +2

      thats a little specific

    • @nnoossiirr
      @nnoossiirr 7 หลายเดือนก่อน +1

      what the fuck

  • @OneroomBeatz
    @OneroomBeatz ปีที่แล้ว +3

    Don't let these Nigerian dating scammers know about this

  • @defialy
    @defialy ปีที่แล้ว

    omg i loved that u used houshou marines voice LMAO

  • @SKYGGEMUSIC
    @SKYGGEMUSIC ปีที่แล้ว +1

    This looks great! I have issues connecting to the server. Is there any update on this?

  • @3eeway
    @3eeway ปีที่แล้ว +5

    RVC is amazing, but the latency is a huge problem

  • @Faze_booger
    @Faze_booger ปีที่แล้ว +1

    I’m having a problem when I try to use the voice changer when I talk I hear a static sound and when I get it to work sometimes it has a very long delay

  • @KyotosEnd
    @KyotosEnd 4 หลายเดือนก่อน +1

    why dont the characters pop up for me

  • @pepadrs
    @pepadrs ปีที่แล้ว

    love how it upgraded and now you can download it from huging face

  • @Leon_S._Kennedy
    @Leon_S._Kennedy ปีที่แล้ว +1

    I cant wait to try this on discord one day

  • @Droid3455
    @Droid3455 ปีที่แล้ว +2

    At this rate we'll never have to touch grass again

  • @NoobsPit
    @NoobsPit ปีที่แล้ว +3

    Everytime I try to launch the voice changer it shows this error and doesn't work Failed to load URL: localhost:18888/ with error: ERR_CONNECTION_REFUSED or when it does load I click on a voice and it says Cannot read properties of null (reading 'enableServerAudio')

  • @sharryboy88
    @sharryboy88 ปีที่แล้ว +1

    if i speak and hear it through my headphones i get an echo and the model says what i said many more times... creepy... how can i fix this???????

  • @whata7570
    @whata7570 ปีที่แล้ว

    This is pretty cool. Does it have means to setup training where you record yourself read a book and then it process that to make a model of you?

  • @Hell_0115
    @Hell_0115 ปีที่แล้ว +1

    Now i know how to prank my friends 😂 thanks man

  • @pamonhachanvr
    @pamonhachanvr ปีที่แล้ว

    Ty for your content, if you activate on NOISE: Echo, Sup1 and Sup2 the voice will be better, clean.

  • @Sal-zn2qu
    @Sal-zn2qu ปีที่แล้ว +2

    It was fun to mess around with this but it takes up LOT of disk space, good thing i made a restore point before trying this out, thanks for the tutorials Jarod, i learned more about coding than my 8th grade Computer teacher teaching me about web design XD

  • @TweetykachuDenzelAbaya
    @TweetykachuDenzelAbaya 3 หลายเดือนก่อน +1

    Banal na snap, ang dami ng kapangyarihan ng pagiging isang Xankfoland Old Man ay papalapit na!
    Isa pa, ang boses ng Marine na iyon kapag matatas siyang nagsasalita ng Filipino ay nakakatuwang isipin na may timeline kung saan natuto si Marine ng Filipino ng HYPER WELL.
    Sana may program na nagsasama-sama ng lahat ng masalimuot na pag-setup sa mas madaling paraan ng pag-set up dahil marunong ako sa xankfolandia ☢☢☢☢

  • @wandychandrawijaya5867
    @wandychandrawijaya5867 2 หลายเดือนก่อน +1

    Hello, i followed all the instructions but mine still doesnt work. Now i want to delete everything, do i just delete the file in file manager? How to uninstall the one downloaded in the cmd program?
    Sorry for my bad english, please let me know @anyone.

  • @Danzhu
    @Danzhu ปีที่แล้ว +1

    Until now can't use AMD GPU, I've followed the method * 2 still can't, it still takes the source from the processor, not from video graphics 😢

  • @Nekoderci
    @Nekoderci ปีที่แล้ว +1

    it works but kind of echos back after it speaks
    like if i say testing it sounds fine but then a second after it repeats it like quieter by itself
    how to fix this?

  • @justanoob6331
    @justanoob6331 ปีที่แล้ว

    Its amaizing that full english Marine sounds just like Amelía

  • @Spookydigy
    @Spookydigy ปีที่แล้ว

    I can't get the audio to sync up. I'm a VTuber and livestreamer and the vocal artifacts (extra weird voice sounds) are constant and there is 4 second delay.

  • @Spookydigy
    @Spookydigy 4 หลายเดือนก่อน +1

    I have a 4070 and can’t get close to 16 so I chose 60. It’s good for a while but then the voice glitches. What causes that