Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
---|---|---|---|---|---|---|
2021 | A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion | Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Ching-Feng Liu; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
2020 | ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech | Xin Wang; Junichi Yamagishi; Massimiliano Todisco; Hector Delgado; Andreas Nautsch; Nicholas Evans; Md Sahidullah; Ville Vestman; Tomi Kinnunen; Kong Aik Lee; Lauri Juvela; Paavo Alku; Yu-Huai Peng; Hsin-Te Hwang; Yu Tsao; Hsin-Min Wang ; Sebastien Le Maguer; Markus Becker; Fergus Henderson; Rob Clark; Yu Zhang; Quan Wang; Ye Jia; Kai Onuma; Koji Mushika; Takashi Kaneda; Yuan Jiang; Li-Juan Liu; Yi-Chiao Wu; Wen-Chin Huang; Tomoki Toda; Kou Tanaka; Hirokazu Kameoka; Ingmar Steiner; Driss Matrouf; Jean-Francois Bonastre; Avashna Govender; Srikanth Ronanki; Jing-Xuan Zhang; Zhen-Hua Ling | Computer Speech and Language 64, 101114 | |||
2019 | Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Kazuhiro Kobayashi; Yu-Huai Peng; Hsin-Te Hwang; Patrick Lumban Tobing; Yu Tsao; Hsin-Min Wang ; Tomoki Toda | ||||
2021 | HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network | Hsin-Tien Chiang; Yi-Chiao Wu; Cheng Yu; Tomoki Toda; Hsin-Min Wang ; Yih-Chun Hu; Yu Tsao | ||||
2019 | Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Chen-Chou Lo; Patrick Lumban Tobing; Tomoki Hayashi; Kazuhiro Kobayashi; Tomoki Toda; Yu Tsao; Hsin-Min Wang | ||||
2021 | Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling | Ming-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang | ||||
2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
2021 | Speech Recognition by Simply Fine-Tuning BERT | Wen-Chin Huang; Chia-Hua Wu; Shang-Bao Luo; Kuan-Yu Chen; Hsin-Min Wang ; Tomoki Toda | ||||
2022 | The VoiceMOS Challenge 2022 | Wen Chin Huang; Erica Cooper; Yu Tsao; Hsin-Min Wang ; Tomoki Toda; Junichi Yamagishi | ||||
2023 | The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains | Erica Cooper; Wen-Chin Huang; Yu Tsao; Hsin-Min Wang ; Tomoki Toda; Junichi Yamagishi | ||||
2021 | Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion | Yi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang |