Demo pages of VC works.

Author: Wen-Chin Huang


(Interspeech 2019) Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion
(SSW10 (2019)) Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion
(TETCI (2019)) Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
(Interspeech 2020) Voice Transformer Network: Sequence-to-Sequence Voice Conversion using Transformer with Text-to-Speech Pretraining
(IEEE/ACM TASLP (2021)) Pretraining Techniques for Sequence-to-Sequence Voice Conversion
(ICASSP 2021) Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
(Interspeech 2021) A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
(ASRU 2021) On Prosody Modeling for ASR+TTS based Voice Conversion
(APSIPA 2021) Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Page design from Google's Tacotron demo page.
Icons made by Freepik from is licensed by CC 3.0 BY