Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
---|---|---|---|---|---|---|
2019 | Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Kazuhiro Kobayashi; Yu-Huai Peng; Hsin-Te Hwang; Patrick Lumban Tobing; Yu Tsao; Hsin-Min Wang ; Tomoki Toda | ||||
2019 | Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion | Wen-Chin Huang; Yi-Chiao Wu; Chen-Chou Lo; Patrick Lumban Tobing; Tomoki Hayashi; Kazuhiro Kobayashi; Tomoki Toda; Yu Tsao; Hsin-Min Wang | ||||
2019 | Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement | Wei-Cheng Lin; Yu Tsao; Hsin-Min Wang ; Fei Chen | ||||
2022 | Lip Sync Matters: A Novel Multimodal Forgery Detector | Sahibzada Adil Shahzad; Ammarah Hashmi; Sarwar Khan; Yan-Tsung Peng; Yu Tsao; Hsin-Min Wang | ||||
2020 | Lite Audio-Visual Speech Enhancement | Shang-Yi Chuang; Yu Tsao; Chen-Chou Lo; Hsin-Min Wang | ||||
2018 | Locally Linear Embedding Based Post-filtering for Speech Enhancement | Hsin-Te Hwang; Yi-Chiao Wu; Syu-Siang Wang; Chin-Cheng Hsu; Yu Tsao; Hsin-Min Wang; Yih-Ru Wang; Sin-Horng Chen | Journal of Information Science and Engineering 34(6), 1469-1491 | |||
2022 | Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN | Yin-Ping Cho; Yu Tsao; Hsin-Min Wang ; Yi-Wen Liu | ||||
2022 | MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids | Ryandhimas Edo Zezario; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
2021 | MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration | Yu-Tao Chang; Yuan-Hong Yang; Yu-Huai Peng; Syu-Siang Wang; Tai-Shih Chi; Yu Tsao; Hsin-Min Wang | ||||
2019 | MOSNet: Deep Learning based Objective Assessment for Voice Conversion | Chen-Chou Lo; Szu-Wei Fu; Wen-Chin Huang; Xin Wang; Junichi Yamagishi; Yu Tsao; Hsin-Min Wang | ||||
2022 | MTI-Net: A Multi-Target Speech Intelligibility Prediction Model | Ryandhimas Edo Zezario; Szu-wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
2022 | Multimodal Forgery Detection Using Ensemble Learning | Ammarah Hashmi; Sahibzada Adil Shahzad; Chia-Wen Lin; Yu Tsao; Hsin-Min Wang | ||||
2022 | NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling | Chi-Chang Lee; Cheng-Hung Hu; Yu-Chen Lin; Chu-Song Chen ; Hsin-Min Wang; Yu Tsao | ||||
2019 | Noise Adaptive Speech Enhancement using Domain Adversarial Training | Chien-Feng Liao; Yu Tsao; Hung-Yi Lee; Hsin-Min Wang | ||||
2022 | Partially Fake Audio Detection by Self-attention-based Fake Span Discovery | Haibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao; Hsin-Min Wang ; Helen Meng | ||||
2018 | Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM | Szu-wei Fu; Yu Tsao; Hsin-Te Hwang; Hsin-Min Wang | ||||
2019 | Reinforcement Learning based Speech Enhancement for Robust Speech Recognition | Yih-Liang Shen; Chao-Yuan Huang; Syu-Siang Wang; Yu Tsao; Hsin-Min Wang ; Tai-Shih Chi | ||||
2020 | Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement | Ryandhimas Edo Zezario; Tassadaq Hussain; Xugang Lu; Hsin-Min Wang ; Yu Tsao | ||||
2021 | Sensing ecosystem dynamics via audio source separation: A case study of marine soundscapes off northeastern Taiwan | Tzu-Hao Lin ; Tomonari Akamatsu; Yu Tsao | PLOS COMPUTATIONAL BIOLOGY 17(2), e1008698 | |||
2020 | SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning | Chi-Chang Lee; Yu-Chen Lin; Hsuan-Tien Lin; Hsin-Min Wang ; Yu Tsao |