Creating Your Own Singing Voice Synthesizer: Overcoming Data Collection Challenges - Matthew Rice

แชร์
ฝัง
  • เผยแพร่เมื่อ 31 ก.ค. 2023
  • Join Us For ADC23 - London - 13-15 November 2023
    More Info: audio.dev/
    @audiodevcon
    Creating Your Own Singing Voice Synthesizer: Overcoming Data Collection Challenges - Matthew Rice - ADCx SF
    While singing voice synthesizers have existed for decades, recent deep-learning-based products (Sinsy, Vocaloid) have greatly improved the quality of the results. However, these systems provide only a limited number of pre-trained "voices" based on proprietary datasets. Luckily, open-source systems (NNSVS, OpenUtau, VISinger, DiffSinger) exist, allowing users to use custom datasets to create a singing voice synthesizer. Unfortunately, creating the necessary datasets is a time-consuming process that requires collecting phoneme-level timing and other data points. As a result, few public datasets are available, and those that do exist are mostly restricted to Mandarin Chinese and Japanese. In this talk, I will demonstrate several approaches to collecting this data, from manual labeling to fully automated procedures, making it easier for everyone to create their own personalized singing voice synthesizers.
    Slides: data.audio.dev/talks/ADCxSF/2...
    _
    Matthew Rice
    Matthew Rice is a master's student at Queen Mary University of London, studying Sound and Music Computing with a focus on music production applications of deep learning. Previously, Matthew was at startup Mayk as a software engineer, working on both the audio engine and audio research teams. Matthew also has experience in digital hardware and embedded systems, having worked at Qualcomm designing PMICs and audio codec drivers.
    Edited by Digital Medium Ltd - online.digital-medium.co.uk
    _
    Organized and produced by JUCE: juce.com/
    _
    Special thanks to the ADC Team:
    Sophie Carus
    Derek Heimlich
    Andrew Kirk
    Bobby Lombardi
    Tom Poole
    Ralph Richbourg
    Jim Roper
    Jonathan Roper
    #audiodevcon #audiodev #synthesizer
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 3

  • @-H-i-e-r-o-n-y-m-e-
    @-H-i-e-r-o-n-y-m-e- หลายเดือนก่อน

    Awesome, thank you. 14:56 Black Mirror S05E03: Rachel, Jack and Ashley Too

  • @imsupposedto7465
    @imsupposedto7465 4 หลายเดือนก่อน +1

    I just stumbled across this! I do voice bank development for DiffSinger primarily but concat synth as well. It's really cool to see these engines mentioned!

  • @user-sh8kx8kl7c
    @user-sh8kx8kl7c 8 หลายเดือนก่อน +2

    such useful content ,and very little views ,how sad..
    thanks a lot