Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
2019 | Reinforcement Learning based Speech Enhancement for Robust Speech Recognition | Yih-Liang Shen; Chao-Yuan Huang; Syu-Siang Wang; Yu Tsao; Hsin-Min Wang ; Tai-Shih Chi | | | | |
2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | | | | |
2018 | SeeTheVoice: Learning from Music to Visual Storytelling of Shots | Wen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Yi-Hsuan Yang ; Hsin-Min Wang ; Hsiao-Rong Tyan; Hong-Yuan Mark Liao | | | | |
2020 | Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement | Ryandhimas Edo Zezario; Tassadaq Hussain; Xugang Lu; Hsin-Min Wang ; Yu Tsao | | | | |
2021 | Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving | Shih-hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su | | | | |
2019 | Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker Identification | Qian-Bei Hong; Chung-Hsien Wu; Ming-Hsiang Su; Hsin-Min Wang | | | | |
2020 | SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning | Chi-Chang Lee; Yu-Chen Lin; Hsuan-Tien Lin; Hsin-Min Wang ; Yu Tsao | | | | |
2019 | Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric | Ryandhimas Edo Zezario; Szu-wei Fu; Xugang Lu; Hsin-Min Wang ; Yu Tsao | | | | |
2020 | Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders | Cheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769 | | | |
2021 | Speech Enhancement with Zero-Shot Model Selection | Ryandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | | | | |
2022 | Speech Enhancement-Assisted Voice Conversion in Noisy Environments | Yun-Ju Chan; Chiang-Jen Peng; Syu-Siang Wang; Hsin-Min Wang ; Yu Tsao; Tai-Shih Chi | | | | |
2021 | Speech Recognition by Simply Fine-Tuning BERT | Wen-Chin Huang; Chia-Hua Wu; Shang-Bao Luo; Kuan-Yu Chen; Hsin-Min Wang ; Tomoki Toda | | | | |
2022 | Speech-enhanced and Noise-aware Networks for Robust Speech Recognition | Hung-Shin Lee; Pin-Yuan Chen; Yao-Fei Cheng; Yu Tsao ; Hsin-Min Wang | | | | |
2019 | Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural Networks | Shang-Bao Luo; Hung-Shin Lee; Kuan-Yu Chen; Hsin-Min Wang | | | | |
2020 | Statistics Pooling Time Delay Neural Network Based on X-Vector for Speaker Verification | Qian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang ; Chien-Lin Huang | | | | |
2020 | STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model | Ryandhimas E. Zezario; Szu-Wei Fu; Chiou-Shann Fuh; Yu Tsao; Hsin-Min Wang | | | | |
2020 | Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition | Lee, Hung-Shin; Tsao, Yu ; Jeng, Shyh-Kang; Wang, Hsin-Min | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 3065-3079 | | | |
2021 | SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours | Yi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang | | | | |
2022 | SVSNet: An End-to-end Speaker Voice Similarity Assessment Model | Cheng-Hung Hu; Yu-Huai Peng; Junichi Yamagishi; Yu Tsao ; Hsin-Min Wang | IEEE Signal Processing Letters 29, 767-771 | | | |
2020 | The Academia Sinica Systems of Speech Recognition and Speaker Diarization for the CHiME-6 Challenge | Hung-Shin Lee; Yu-Huai Peng; Pin-Tuan Huang; Ying-Chun Tseng; Chia-Hua Wu; Yu Tsao; Hsin-Min Wang | | | | |