Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
---|---|---|---|---|---|---|
2021 | A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion | Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Ching-Feng Liu; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
2020 | ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech | Xin Wang; Junichi Yamagishi; Massimiliano Todisco; Hector Delgado; Andreas Nautsch; Nicholas Evans; Md Sahidullah; Ville Vestman; Tomi Kinnunen; Kong Aik Lee; Lauri Juvela; Paavo Alku; Yu-Huai Peng; Hsin-Te Hwang; Yu Tsao; Hsin-Min Wang ; Sebastien Le Maguer; Markus Becker; Fergus Henderson; Rob Clark; Yu Zhang; Quan Wang; Ye Jia; Kai Onuma; Koji Mushika; Takashi Kaneda; Yuan Jiang; Li-Juan Liu; Yi-Chiao Wu; Wen-Chin Huang; Tomoki Toda; Kou Tanaka; Hirokazu Kameoka; Ingmar Steiner; Driss Matrouf; Jean-Francois Bonastre; Avashna Govender; Srikanth Ronanki; Jing-Xuan Zhang; Zhen-Hua Ling | Computer Speech and Language 64, 101114 | |||
2019 | Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Kazuhiro Kobayashi; Yu-Huai Peng; Hsin-Te Hwang; Patrick Lumban Tobing; Yu Tsao; Hsin-Min Wang ; Tomoki Toda | ||||
2019 | Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Chen-Chou Lo; Patrick Lumban Tobing; Tomoki Hayashi; Kazuhiro Kobayashi; Tomoki Toda; Yu Tsao; Hsin-Min Wang | ||||
2021 | Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling | Ming-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang | ||||
2019 | MOSNet: Deep Learning based Objective Assessment for Voice Conversion | Chen-Chou Lo; Szu-Wei Fu; Wen-Chin Huang; Xin Wang; Junichi Yamagishi; Yu Tsao; Hsin-Min Wang | ||||
2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
2021 | Speech Recognition by Simply Fine-Tuning BERT | Wen-Chin Huang; Chia-Hua Wu; Shang-Bao Luo; Kuan-Yu Chen; Hsin-Min Wang ; Tomoki Toda | ||||
2021 | Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion | Yi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang | ||||
2018 | Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders | Wen-Chin Huang; Hsin-Te Hwang; Yu-Huai Peng; Yu Tsao; Hsin-Min Wang | ||||
2018 | WaveNet Vocoder and its Applications in Voice Conversion | Wen-Chin Huang; Chen-Chou Lo; Hsin-Te Hwang; Yu Tsao; Hsin-Min Wang |