Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
2016 | Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification | Lu, Xugang; Shen, Peng; Tsao, Yu ; Kawai, Hisashi | | | | |
2022 | Partially Fake Audio Detection by Self-attention-based Fake Span Discovery | Haibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng | | | | |
2021 | QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization | Gang-Xuan Lin; Shih-Wei Hu; Yen-Ju Lu; Yu Tsao ; Chun-Shien Lu | | | | |
2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | | | | |
2021 | Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification | X. Lu; P. Shen; Y. Tsao ; H. Kawai | | | | |
2016 | SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement | Fu, Szu-Wei; Tsao, Yu ; Lu, Xugang | | | | |
2019 | Source separation in ecoacoustics: A roadmap towards versatile soundscape information retrieval | Lin, Tzu-Hao ; Tsao, Yu | Remote Sensing in Ecology and Conservation 6(3), 236-247 | | | |
2020 | Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders | Cheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769 | | | |
2021 | Speech Enhancement with Zero-Shot Model Selection | Ryandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | | | | |
2022 | Speech-enhanced and Noise-aware Networks for Robust Speech Recognition | Hung-Shin Lee; Pin-Yuan Chen; Yao-Fei Cheng; Yu Tsao ; Hsin-Min Wang | | | | |
2020 | Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition | Lee, Hung-Shin; Tsao, Yu ; Jeng, Shyh-Kang; Wang, Hsin-Min | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 3065-3079 | | | |
2022 | SVSNet: An End-to-end Speaker Voice Similarity Assessment Model | Cheng-Hung Hu; Yu-Huai Peng; Junichi Yamagishi; Yu Tsao ; Hsin-Min Wang | IEEE Signal Processing Letters 29, 767-771 | | | |
2021 | Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion | Yi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang | | | | |
2020 | Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement | Yu, Cheng; Hung, Kuo-Hsuan; Wang, Syu-Siang; Tsao, Yu ; Hung, Jeih-weih | IEEE Signal Processing Letters 27, 1035-1039 | | | |
2021 | Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport | H.-Y. Lin; H.-H. Tseng; X. Lu; Yu Tsao | | | | |
2020 | Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion | Huang, Wen-Chin; Luo, Hao; Hwang, Hsin-Te; Lo, Chen-Chou; Peng, Yu-Huai; Tsao, Yu ; Wang, Hsin-Min | IEEE Transactions on Emerging Topics in Computational Intelligence 4(4), 468-479 | | | |
2016 | Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder | Chin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang | | | | |
2017 | Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks | Chin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang | | | | |
2020 | WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement | Tsun-An Hsieh; Hsin-Min Wang ; Xugang Lu; Yu Tsao | IEEE Signal Processing Letters 27, 2149-2153 | | | |
2017 | Wavelet Speech Enhancement Based on Robust Principal Component Analysis | Chia-Lung Wu; Hsiang-Ping Hsu; Syu-Siang Wang; Jeih-Weih Hung; Ying-Hui Lai; Hsin-Min Wang ; Yu Tsao | | | | |