Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
---|---|---|---|---|---|---|
2021 | MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration | Y.-T. Chang; Y.-H. Yang; Y.-H. Peng; S.-S. Wang; T.-S. Chi; Y. Tsao ; H.-M. Wang | ||||
2019 | MOSNet: Deep Learning based Objective Assessment for Voice Conversion | Chen-Chou Lo; Szu-Wei Fu; Wen-Chin Huang; Xin Wang; Junichi Yamagishi; Yu Tsao; Hsin-Min Wang | ||||
2022 | MTI-Net: A Multi-Target Speech Intelligibility Prediction Model | Ryandhimas Edo Zezario; Szu-wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
2019 | Multi-task Learning for Mandarin Acoustic Modeling Using Articulatory Attributes | Yueh-Ting Lee; Xuan-Bo Chen; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang | ||||
2020 | Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks | Chang-Le Liu; Sze-Wei Fu; You-Jin Li; Jen-Wei Huang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1888-1900 | |||
2022 | Multimodal Forgery Detection Using Ensemble Learning | Ammarah Hashmi; Sahibzada Adil Shahzad; Chia-Wen Lin; Yu Tsao; Hsin-Min Wang | ||||
2019 | Noise Adaptive Speech Enhancement using Domain Adversarial Training | Chien-Feng Liao; Yu Tsao; Hung-Yi Lee; Hsin-Min Wang | ||||
2022 | Partially Fake Audio Detection by Self-attention-based Fake Span Discovery | Haibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng | ||||
2019 | Reinforcement Learning based Speech Enhancement for Robust Speech Recognition | Yih-Liang Shen; Chao-Yuan Huang; Syu-Siang Wang; Yu Tsao; Hsin-Min Wang ; Tai-Shih Chi | ||||
2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
2018 | SeeTheVoice: Learning from Music to Visual Storytelling of Shots | Wen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Yi-Hsuan Yang ; Hsin-Min Wang ; Hsiao-Rong Tyan; Hong-Yuan Mark Liao | ||||
2020 | Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement | Ryandhimas Edo Zezario; Tassadaq Hussain; Xugang Lu; Hsin-Min Wang ; Yu Tsao | ||||
2021 | Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving | Shih-hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su | ||||
2019 | Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker Identification | Qian-Bei Hong; Chung-Hsien Wu; Ming-Hsiang Su; Hsin-Min Wang | ||||
2020 | SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning | Chi-Chang Lee; Yu-Chen Lin; Hsuan-Tien Lin; Hsin-Min Wang ; Yu Tsao | ||||
2019 | Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric | Ryandhimas Edo Zezario; Szu-wei Fu; Xugang Lu; Hsin-Min Wang ; Yu Tsao | ||||
2020 | Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders | Cheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769 | |||
2021 | Speech Enhancement with Zero-Shot Model Selection | Ryandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
2022 | Speech Enhancement-Assisted Voice Conversion in Noisy Environments | Yun-Ju Chan; Chiang-Jen Peng; Syu-Siang Wang; Hsin-Min Wang ; Yu Tsao; Tai-Shih Chi | ||||
2021 | Speech Recognition by Simply Fine-Tuning BERT | Wen-Chin Huang; Chia-Hua Wu; Shang-Bao Luo; Kuan-Yu Chen; Hsin-Min Wang ; Tomoki Toda |