Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
---|---|---|---|---|---|---|
2021 | Learning To Visualize Music Through Shot Sequence For Automatic Concert Video Mashup | Wen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Hsiao-Rong Tyan; Hsin-Min Wang ; Hong-Yuan Mark Liao | IEEE Transactions on Multimedia 23, 1731-1743 | |||
2022 | Lip Sync Matters: A Novel Multimodal Forgery Detector | Sahibzada Adil Shahzad; Ammarah Hashmi; Sarwar Khan; Yan-Tsung Peng; Yu Tsao; Hsin-Min Wang | ||||
2020 | Lite Audio-Visual Speech Enhancement | Shang-Yi Chuang; Yu Tsao; Chen-Chou Lo; Hsin-Min Wang | ||||
2016 | Locally Linear Embedding for Exemplar-Based Spectral Conversion | Yi-Chiao Wu; Hsin-Te Hwang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang | ||||
2021 | Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling | Ming-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang | ||||
2019 | Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task Learning | Xuan-Bo Chen; Yueh-Ting Lee; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang | ||||
2022 | Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN | Yin-Ping Cho; Yu Tsao; Hsin-Min Wang ; Yi-Wen Liu | ||||
2022 | MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids | Ryandhimas Edo Zezario; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
2021 | Melody Harmonization Using Orderless NADE, Chord Balancing, and Blocked Gibbs Sampling | Chung-En Sun; Yi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang | ||||
2016 | Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation | Hung-Shin Lee; Yu Tsao ; Chi-Chun Lee; Hsin-Min Wang ; Wei-Cheng Lin; Wei-Chen Chen; Shan-Wen Hsiao; Shyh-Kang Jeng | ||||
2021 | Mining Commonsense and Domain Knowledge from Math Word Problems | Shih-Hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su | ||||
2021 | MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration | Y.-T. Chang; Y.-H. Yang; Y.-H. Peng; S.-S. Wang; T.-S. Chi; Y. Tsao ; H.-M. Wang | ||||
2019 | MOSNet: Deep Learning based Objective Assessment for Voice Conversion | Chen-Chou Lo; Szu-Wei Fu; Wen-Chin Huang; Xin Wang; Junichi Yamagishi; Yu Tsao; Hsin-Min Wang | ||||
2022 | MTI-Net: A Multi-Target Speech Intelligibility Prediction Model | Ryandhimas Edo Zezario; Szu-wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
2019 | Multi-task Learning for Mandarin Acoustic Modeling Using Articulatory Attributes | Yueh-Ting Lee; Xuan-Bo Chen; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang | ||||
2020 | Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks | Chang-Le Liu; Sze-Wei Fu; You-Jin Li; Jen-Wei Huang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1888-1900 | |||
2022 | Multimodal Forgery Detection Using Ensemble Learning | Ammarah Hashmi; Sahibzada Adil Shahzad; Chia-Wen Lin; Yu Tsao; Hsin-Min Wang | ||||
2019 | Noise Adaptive Speech Enhancement using Domain Adversarial Training | Chien-Feng Liao; Yu Tsao; Hung-Yi Lee; Hsin-Min Wang | ||||
2022 | Partially Fake Audio Detection by Self-attention-based Fake Span Discovery | Haibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng | ||||
2019 | Reinforcement Learning based Speech Enhancement for Robust Speech Recognition | Yih-Liang Shen; Chao-Yuan Huang; Syu-Siang Wang; Yu Tsao; Hsin-Min Wang ; Tai-Shih Chi |