公開日期 | 題名 | 作者 | 關聯 | scopus | WOS | 全文 |
2020 | Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks | Chang-Le Liu; Sze-Wei Fu; You-Jin Li; Jen-Wei Huang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1888-1900 | | | |
2020 | Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media | Abousaleh, Fatma S,; Cheng, Wen-Huang; Yu, Neng-Hao; Tsao, Yu | IEEE Transactions on Cognitive and Developmental Systems 13(3), 679-692 | | | |
2016 | Nonnegative matrix factorization-based frequency lowering technology for Mandarin-speaking hearing aid users | Liu, Yen-Teh; Tsao, Yu ; Chang, Ronald Y. | | | | |
2016 | Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification | Lu, Xugang; Shen, Peng; Tsao, Yu ; Kawai, Hisashi | | | | |
2022 | Partially Fake Audio Detection by Self-attention-based Fake Span Discovery | Haibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng | | | | |
2021 | QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization | Gang-Xuan Lin; Shih-Wei Hu; Yen-Ju Lu; Yu Tsao ; Chun-Shien Lu | | | | |
2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | | | | |
2021 | Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification | X. Lu; P. Shen; Y. Tsao ; H. Kawai | | | | |
2016 | SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement | Fu, Szu-Wei; Tsao, Yu ; Lu, Xugang | | | | |
2019 | Source separation in ecoacoustics: A roadmap towards versatile soundscape information retrieval | Lin, Tzu-Hao ; Tsao, Yu | Remote Sensing in Ecology and Conservation 6(3), 236-247 | | | |
2020 | Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders | Cheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769 | | | |
2021 | Speech Enhancement with Zero-Shot Model Selection | Ryandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | | | | |
2022 | Speech-enhanced and Noise-aware Networks for Robust Speech Recognition | Hung-Shin Lee; Pin-Yuan Chen; Yao-Fei Cheng; Yu Tsao ; Hsin-Min Wang | | | | |
2020 | Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition | Lee, Hung-Shin; Tsao, Yu ; Jeng, Shyh-Kang; Wang, Hsin-Min | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 3065-3079 | | | |
2022 | SVSNet: An End-to-end Speaker Voice Similarity Assessment Model | Cheng-Hung Hu; Yu-Huai Peng; Junichi Yamagishi; Yu Tsao ; Hsin-Min Wang | IEEE Signal Processing Letters 29, 767-771 | | | |
2021 | Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion | Yi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang | | | | |
2020 | Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement | Yu, Cheng; Hung, Kuo-Hsuan; Wang, Syu-Siang; Tsao, Yu ; Hung, Jeih-weih | IEEE Signal Processing Letters 27, 1035-1039 | | | |
2021 | Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport | H.-Y. Lin; H.-H. Tseng; X. Lu; Yu Tsao | | | | |
2020 | Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion | Huang, Wen-Chin; Luo, Hao; Hwang, Hsin-Te; Lo, Chen-Chou; Peng, Yu-Huai; Tsao, Yu ; Wang, Hsin-Min | IEEE Transactions on Emerging Topics in Computational Intelligence 4(4), 468-479 | | | |
2016 | Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder | Chin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang | | | | |