Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
2022 | Filter-based Discriminative Autoencoders for Children Speech Recognition | Chiang-Lin Tai; Hung-Shin Lee; Yu Tsao ; Hsin-Min Wang | | | | |
2023 | Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification | Qian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang | IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 486-499 | | | |
2019 | Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Kazuhiro Kobayashi; Yu-Huai Peng; Hsin-Te Hwang; Patrick Lumban Tobing; Yu Tsao; Hsin-Min Wang ; Tomoki Toda | | | | |
2021 | Generation of Speaker Representations Using Heterogeneous Training Batch Assembly | Yu-Huai Peng; Hung-Shin Lee; Pin-Tuan Huang; Hsin-Min Wang | | | | |
2021 | HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network | Hsin-Tien Chiang; Yi-Chiao Wu; Cheng Yu; Tomoki Toda; Hsin-Min Wang ; Yih-Chun Hu; Yu Tsao | | | | |
2022 | Improved Lite Audio-Visual Speech Enhancement | Shang-Yi Chuang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1345-1359 | | | |
2021 | Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel Attention | Qian-Bei Hong; Chung-Hsien Wu; Thanh Binh Nguyen; Hsin-Min Wang | | | | |
2019 | Improving Automatic Jazz Melody Generation by Transfer Learning Techniques | Hsiao-Tzu Hung; Chung-Yang Wang; Yi-Hsuan Yang ; Hsin-Min Wang | | | | |
2020 | Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks | Wang, Natalie Yu-Hsien; Wang, Hsiao-Lan Sharon; Wang, Tao-Wei; Fu, Szu-Wei; Lu, Xugan; Wang, Hsin-Min ; Tsao, Yu | IEEE Transactions on Neural Systems and Rehabilitation Engineering 29, 184-195 | | | |
2019 | Influences of Prosodic Feature Replacement on the Perceived Singing Voice Identity | Kuan-Yi Kang; Yi-Wen Liu; Hsin-Min Wang | | | | |
2021 | Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions | Md Mahbub E Noor|; Yen-Ju Lu; Syu-Siang Wang; Supratip Ghose; Chia-Yu Chang; Ryandhimas E. Zezario; Shafique Ahmed; Wei-Ho Chung; Yu Tsao ; Hsin-Min Wang | | | | |
2019 | Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Chen-Chou Lo; Patrick Lumban Tobing; Tomoki Hayashi; Kazuhiro Kobayashi; Tomoki Toda; Yu Tsao; Hsin-Min Wang | | | | |
2019 | Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement | Wei-Cheng Lin; Yu Tsao; Hsin-Min Wang ; Fei Chen | | | | |
2020 | Joint Training of Guided Learning and Mean Teacher Models for Sound Event Detection | Hao Yen; Pin-Jui Ku; Ming-Chi Yen; Hung-Shin Lee; Hsin-Min Wang | | | | |
2021 | Learning To Visualize Music Through Shot Sequence For Automatic Concert Video Mashup | Wen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Hsiao-Rong Tyan; Hsin-Min Wang ; Hong-Yuan Mark Liao | IEEE Transactions on Multimedia 23, 1731-1743 | | | |
2022 | Lip Sync Matters: A Novel Multimodal Forgery Detector | Sahibzada Adil Shahzad; Ammarah Hashmi; Sarwar Khan; Yan-Tsung Peng; Yu Tsao; Hsin-Min Wang | | | | |
2020 | Lite Audio-Visual Speech Enhancement | Shang-Yi Chuang; Yu Tsao; Chen-Chou Lo; Hsin-Min Wang | | | | |
2016 | Locally Linear Embedding for Exemplar-Based Spectral Conversion | Yi-Chiao Wu; Hsin-Te Hwang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang | | | | |
2021 | Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling | Ming-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang | | | | |
2019 | Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning | Xuan-Bo Chen; Yueh-Ting Lee; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang | | | | |