Wen-Chin Huang 黃文勁

I am currently a Ph.D. candidate supervised by Prof. Tomoki Toda in Toda Laboratory at the Graduate School of Informatics, Nagoya University. I received M.S. in the Graduate School of Informatics, Nagoya University at March 2021. Prior to studying at N.U., I received B.S. in Computer Science and Information Engineering from National Taiwan University in June 2018.

I started my internship at Facebook Reality Labs (FRL) Research from August 2021. I work closely with the Institute of Information Science in Academia Sinica, Taipei, Taiwan with advisor Prof. Hsin-Min Wang, since I was a research assistant from July 2017 to March 2019. Form August 2019 to September 2019, I interned at the NTT Communication Science Laboratories, NTT Corporation under the supervision of Prof. Hirokazu Kameoka.

I received the Research Fellowship for Young Scientists (DC1) from Japan Society for the Promotion of Science (JSPS), which will last from April 2021 to March 2024. I co-organized the Voice Conversion Challenge 2020. I was honored the Best Student Paper Award in the 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2018. I am also a reviewer of several journals including IEEE SPL, IEEE/ACM TASLP, etc.

My research interests include speech processing and machine learning. In particular, I am currently working on voice conversion using deep neural network based models.

I also love street dancing (locking). My team participated in a national dance contest. Check out this video, this video, this video, this video, and this video.

My CV can be downloaded here.

Photo taken at Toyota-Shi, Japan, Nov. 2019.

news

Sep, 2021 One first-author paper [Prosody for ASR+TTS VC] was accepted to ASRU 2021. Also, one paper I co-authored [ELVC w/ Seq2seq] was accepted.
Sep, 2021 Three co-author papers were accepted to APSIPA ASC 2021. [ELVC w/ lip] [Noisy-to-noisy VC] [Investigation of non-parallel seq2seq VC w/ synthetic data]
Aug, 2021 I started my internship at Facebook Reality Labs Research.
Jul, 2021 You can read some posts I wrote in the blog page, as long as you understand Mandarin Chinese.
Jun, 2021 One first-author paper [Dysarthric VC w/ VTN+VAE] was accepted to Interspeech 2021. Also, one paper I co-authored [Relational data selection] was accepted.
Feb, 2021 I successfully defensed my master’s thesis. Also, I successfully passed the Ph.D. entrance exam, and will become a Ph.D. candidate at the Graduate School of Informatics, Nagoya University.
Jan, 2021 One paper [EMA2S] was accepted to IEEE International Symposium on Circuits and Systems (ISCAS) 2021.
Jan, 2021 Two first-author papers [VQVAE-VC] [BERT-ASR] were accepted to ICASSP 2021. Also, two papers I co-authored [crank] [NonAR seq2seq VC] were also accepted.
Jan, 2021 One journal was accepted to the IEEE/ACM Transactions on Audio, Speech, and Language Processing. The early access version is available now on IEEE Xplore. There is also an arXiv version.
Oct, 2020 Four papers are accepted to the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020! [Challenge Summary] [Objective Assesement] [Baseline ASR+TTS] [NU entry]
Oct, 2020 The proceeding of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 is online now!
Jul, 2020 The implementation of VTN is open-sourced on ESPnet.
Jul, 2020 One paper [VTN] was accepted to Interspeech 2020.
May, 2020 One journal paper [ASVspoof 2019 database] was accepted to the Computer Speech & Language.
Mar, 2020 I am co-organizing the Voice Conversion Challenge 2020. I developed a seq-to-seq baseline w/ ESPnet.
Jan, 2020 One journal paper [CDVAE-CLS-GAN] was accepted to the IEEE Transactions on Emerging Topics in Computational Intelligence.