I am currently a Ph.D. candidate supervised by Prof. Tomoki Toda in Toda Laboratory at the Graduate School of Informatics, Nagoya University. I received M.S. in the Graduate School of Informatics, Nagoya University at March 2021. Prior to studying at N.U., I received B.S. in Computer Science and Information Engineering from National Taiwan University in June 2018.
I started my internship at Facebook Reality Labs (FRL) Research from August 2021. I work closely with the Institute of Information Science in Academia Sinica, Taipei, Taiwan with advisor Prof. Hsin-Min Wang, since I was a research assistant from July 2017 to March 2019. Form August 2019 to September 2019, I interned at the NTT Communication Science Laboratories, NTT Corporation under the supervision of Prof. Hirokazu Kameoka.
I received the Research Fellowship for Young Scientists (DC1) from Japan Society for the Promotion of Science (JSPS), which will last from April 2021 to March 2024. I co-organized the Voice Conversion Challenge 2020. I was honored the Best Student Paper Award in the 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2018. I am also a reviewer of several journals including IEEE SPL, IEEE/ACM TASLP, etc.
My research interests include speech processing and machine learning. In particular, I am currently working on voice conversion using deep neural network based models.
My CV can be downloaded here.
|Sep, 2021||One first-author paper [Prosody for ASR+TTS VC] was accepted to ASRU 2021. Also, one paper I co-authored [ELVC w/ Seq2seq] was accepted.|
|Sep, 2021||Three co-author papers were accepted to APSIPA ASC 2021. [ELVC w/ lip] [Noisy-to-noisy VC] [Investigation of non-parallel seq2seq VC w/ synthetic data]|
|Aug, 2021||I started my internship at Facebook Reality Labs Research.|
|Jul, 2021||You can read some posts I wrote in the blog page, as long as you understand Mandarin Chinese.|
|Jun, 2021||One first-author paper [Dysarthric VC w/ VTN+VAE] was accepted to Interspeech 2021. Also, one paper I co-authored [Relational data selection] was accepted.|
|Feb, 2021||I successfully defensed my master’s thesis. Also, I successfully passed the Ph.D. entrance exam, and will become a Ph.D. candidate at the Graduate School of Informatics, Nagoya University.|
|Jan, 2021||One paper [EMA2S] was accepted to IEEE International Symposium on Circuits and Systems (ISCAS) 2021.|
|Jan, 2021||Two first-author papers [VQVAE-VC] [BERT-ASR] were accepted to ICASSP 2021. Also, two papers I co-authored [crank] [NonAR seq2seq VC] were also accepted.|
|Jan, 2021||One journal was accepted to the IEEE/ACM Transactions on Audio, Speech, and Language Processing. The early access version is available now on IEEE Xplore. There is also an arXiv version.|
|Oct, 2020||Four papers are accepted to the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020! [Challenge Summary] [Objective Assesement] [Baseline ASR+TTS] [NU entry]|
|Oct, 2020||The proceeding of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 is online now!|
|Jul, 2020||The implementation of VTN is open-sourced on ESPnet.|
|Jul, 2020||One paper [VTN] was accepted to Interspeech 2020.|
|May, 2020||One journal paper [ASVspoof 2019 database] was accepted to the Computer Speech & Language.|
|Mar, 2020||I am co-organizing the Voice Conversion Challenge 2020. I developed a seq-to-seq baseline w/ ESPnet.|
|Jan, 2020||One journal paper [CDVAE-CLS-GAN] was accepted to the IEEE Transactions on Emerging Topics in Computational Intelligence.|