Results 1-62 of 62 (Search time: 0.001 seconds).
Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link | |
---|---|---|---|---|---|---|---|
1 | 2023 | Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features | Ryandhimas E. Zezario; Szu-Wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 54-70 | |||
2 | 2022 | Speech-enhanced and Noise-aware Networks for Robust Speech Recognition | Hung-Shin Lee; Pin-Yuan Chen; Yao-Fei Cheng; Yu Tsao ; Hsin-Min Wang | ||||
3 | 2022 | Chinese Movie Dialogue Question Answering Dataset | Shang-Bao Luo; Hsin-Min Wang ; Kuan-Yu Chen; Keh-Yih Su ; Yu Tsao ; Cheng-Chung Fan | ||||
4 | 2022 | Filter-based Discriminative Autoencoders for Children Speech Recognition | Chiang-Lin Tai; Hung-Shin Lee; Yu Tsao ; Hsin-Min Wang | ||||
5 | 2022 | EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement | Kuan-Chen Wang; Kai-Chun Liu; Hsin-Min Wang ; Yu Tsao | ||||
6 | 2022 | Partially Fake Audio Detection by Self-attention-based Fake Span Discovery | Haibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng | ||||
7 | 2022 | Boundary-Preserved Deep Denoising of Stochastic Resonance Enhanced Multiphoton Images | Niu, Sheng-Yong; Guo, Lun-Zhang; Li, Yue; Zhang, Zhiming; Wang, Tzung-Dau; Liu, Kai-Chun; Li, You-Jin; Tsao, Yu ; Liu, Tzu-Ming | IEEE Journal of Translational Engineering in Health and Medicine 10, 1800812 | |||
8 | 2022 | Improved Lite Audio-Visual Speech Enhancement | Shang-Yi Chuang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1345-1359 | |||
9 | 2022 | SVSNet: An End-to-end Speaker Voice Similarity Assessment Model | Cheng-Hung Hu; Yu-Huai Peng; Junichi Yamagishi; Yu Tsao ; Hsin-Min Wang | IEEE Signal Processing Letters 29, 767-771 | |||
10 | 2021 | A Study on Speech Enhancement Based on Diffusion Probabilistic Model | Y.-J. Lu; Y. Tsao ; S. Watanabe | ||||
11 | 2021 | Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues | Z. Feng; Yu Tsao ; F. Chen | ||||
12 | 2021 | MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder | Y.-J. Li; S.-S. Wang; Y. Tsao ; B. Su | ||||
13 | 2021 | Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification | X. Lu; P. Shen; Y. Tsao ; H. Kawai | ||||
14 | 2021 | Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion | Yi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang | ||||
15 | 2021 | HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network | Hsin-Tien Chiang; Yi-Chiao Wu; Cheng Yu; Tomoki Toda; Hsin-Min Wang ; Yih-Chun Hu; Yu Tsao | ||||
16 | 2021 | An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition | X. Chang; T. Maekaku; P. Guo; J. Shi; Y.-J. Lu; A. S. Subramanian; T. Wang; S.-w. Yang; Y. Tsao ; H.-y. Lee; S. Watanabe | ||||
17 | 2021 | Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport | H.-Y. Lin; H.-H. Tseng; X. Lu; Yu Tsao | ||||
18 | 2021 | Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling | Ming-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang | ||||
19 | 2021 | Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions | Md Mahbub E Noor|; Yen-Ju Lu; Syu-Siang Wang; Supratip Ghose; Chia-Yu Chang; Ryandhimas E. Zezario; Shafique Ahmed; Wei-Ho Chung; Yu Tsao ; Hsin-Min Wang | ||||
20 | 2021 | A Flexible and Extensible Framework for Multiple Answer Modes Question Answering | Cheng-Chung Fan; Keh-Yih Su ; Kuan-Yu Chen; Yu Tsao ; Jia-Zhi Guo; Shang-Bao Luo; Pei-Jun Liao; Kuang-Yu Chang; Chiao-Wei Hsu; Meng-Tse Wu; Shih-Hong Tsai; Tzu-Man Wu; Aleksandra Smolka; Chao-Chun Liang; Hsin-Min Wang | ||||
21 | 2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
22 | 2021 | QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization | Gang-Xuan Lin; Shih-Wei Hu; Yen-Ju Lu; Yu Tsao ; Chun-Shien Lu | ||||
23 | 2021 | MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement | S.-W. Fu; C. Yu; T.-A. Hsieh; P. Plantinga; M. Ravanelli; X. Lu; Y. Tsao | ||||
24 | 2021 | A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion | Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Ching-Feng Liu; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
25 | 2021 | Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement | T.-A. Hsieh; C. Yu; S.-W. Fu; X. Lu; Y. Tsao | ||||
26 | 2021 | Speech Enhancement with Zero-Shot Model Selection | Ryandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
27 | 2021 | A Study of Incorporating Articulatory Movement Information in Speech Enhancement | Y.-W. Chen; K.-H. Hung; S.-Y. Chuang; J. Sherman; X. Lu; Y. Tsao | ||||
28 | 2021 | Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder | T.-Y. Lu; K.-C. Liu; C.-Y. Hsieh; C.-Y. Chang; Y. Tsao ; C.-T. Chan | ||||
29 | 2021 | Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario | C.-J. Peng; Y.-J. Chan; C. Yu; S.-S. Wang; Y. Tsao ; T.-S. Chi | ||||
30 | 2021 | EMA2S: An End-to-End Multimodal Articulatory-to-Speech System | Y.-W. Chen; K.-H. Hung; S.-Y. Chuang; J. Sherman; W.-C. Huang; X. Lu; Y. Tsao | ||||
31 | 2021 | MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration | Y.-T. Chang; Y.-H. Yang; Y.-H. Peng; S.-S. Wang; T.-S. Chi; Y. Tsao ; H.-M. Wang | ||||
32 | 2020 | Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks | Wang, Natalie Yu-Hsien; Wang, Hsiao-Lan Sharon; Wang, Tao-Wei; Fu, Szu-Wei; Lu, Xugan; Wang, Hsin-Min ; Tsao, Yu | IEEE Transactions on Neural Systems and Rehabilitation Engineering 29, 184-195 | |||
33 | 2020 | Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition | Lee, Hung-Shin; Tsao, Yu ; Jeng, Shyh-Kang; Wang, Hsin-Min | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 3065-3079 | |||
34 | 2020 | WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement | Tsun-An Hsieh; Hsin-Min Wang ; Xugang Lu; Yu Tsao | IEEE Signal Processing Letters 27, 2149-2153 | |||
35 | 2020 | Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders | Cheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769 | |||
36 | 2020 | Blind monaural source separation on heart and lung sounds based on periodic-coded deep autoencoder | Kun-Hsi Tsai; Wei-Chien Wang; Chui-Hsuan Cheng; Chan-Yen Tsai; Jou-Kou Wang; Tzu-Hao Lin ; Shih-Hau Fang; Li-Chin Chen; Yu Tsao | IEEE Journal of Biomedical and Health Informatics 24(11), 3203-3214 | |||
37 | 2020 | Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion | Huang, Wen-Chin; Luo, Hao; Hwang, Hsin-Te; Lo, Chen-Chou; Peng, Yu-Huai; Tsao, Yu ; Wang, Hsin-Min | IEEE Transactions on Emerging Topics in Computational Intelligence 4(4), 468-479 | |||
38 | 2020 | Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks | Chang-Le Liu; Sze-Wei Fu; You-Jin Li; Jen-Wei Huang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1888-1900 | |||
39 | 2020 | Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement | Yu, Cheng; Hung, Kuo-Hsuan; Wang, Syu-Siang; Tsao, Yu ; Hung, Jeih-weih | IEEE Signal Processing Letters 27, 1035-1039 | |||
40 | 2020 | A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation | Tseng, Rung-Yu; Wang, Tao-Wei; Fu, Szu-Wei; Fu, Szu-Wei; Lee, Chia-Ying; Tsao, Yu | IEEE Transactions on Cognitive and Developmental Systems (Early Access) | |||
41 | 2020 | Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes | Hidayati, Shintami Chusnul; Goh, Ting Wei; Chan, Ji-Sheng Gary; Hsu, Cheng-Chun; See, John; Wong, Lai-Kuan; Hua, Kai-Lung; Tsao, Yu ; Cheng, Wen-Huang | IEEE Transactions on Multimedia 23, 365-377 | |||
42 | 2020 | Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media | Abousaleh, Fatma S,; Cheng, Wen-Huang; Yu, Neng-Hao; Tsao, Yu | IEEE Transactions on Cognitive and Developmental Systems 13(3), 679-692 | |||
43 | 2019 | Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation | Hussain, Tassadaq; Siniscalchi, Sabato Marco; Wang, Hsiao-Lan Sharon; Tsao, Yu ; Salerno, Valerio Mario; Liao, Wen-Hung | IEEE Transactions on Cognitive and Developmental Systems 12(4), 744-758 | |||
44 | 2019 | Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality | Fu, Szu-Wei; Liao, Chien-Feng; Tsao, Yu | IEEE Signal Processing Letters 27, 26-30 | |||
45 | 2019 | Source separation in ecoacoustics: A roadmap towards versatile soundscape information retrieval | Lin, Tzu-Hao ; Tsao, Yu | Remote Sensing in Ecology and Conservation 6(3), 236-247 | |||
46 | 2017 | Wavelet Speech Enhancement Based on Robust Principal Component Analysis | Chia-Lung Wu; Hsiang-Ping Hsu; Syu-Siang Wang; Jeih-Weih Hung; Ying-Hui Lai; Hsin-Min Wang ; Yu Tsao | ||||
47 | 2017 | Discriminative Autoencoders for Acoustic Modeling | Ming-Han Yang; Hung-Shin Lee; Yu-Ding Lu; Kuan-Yu Chen; Yu Tsao ; Berlin Chen; Hsin-Min Wang | ||||
48 | 2017 | A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement | Yi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang | ||||
49 | 2017 | Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks | Chin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang | ||||
50 | 2017 | Adaptive Dynamic Range Compression for Improving Envelope-Based Speech Perception: Implications for Cochlear Implants | Ying-Hui Lai; Fei Chen; Yu Tsao | Emerging Technology and Architecture for Big-data Analytics (Switzerland : Springer) | |||
51 | 2017 | Discriminative Autoencoders for Speaker Verification | Hung-Shin Lee; Yu-Ding Lu; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang ; Shyh-Kang Jeng | ||||
52 | 2017 | A Locally Linear Embbeding Based Postfiltering Approach for Speech Enhancement | Yi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Ying-Hui Lai; Yu Tsao ; Hsin-Min Wang | ||||
53 | 2016 | Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder | Chin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang | ||||
54 | 2016 | Audio-Visual Speech Enhancement using Deep Neural Networks | Jen-Cheng Hou; Syu-Siang Wang; Ying-Hui Lai; Jen-Chun Lin; Yu Tsao ; Hsiu-Wen Chang; Hsin-Min Wang | ||||
55 | 2016 | Incorporating local environment information with ensemble neural networks to robust automatic speech recognition | Hsu, Chia-Yung; Zezario, Ryandhimas E.; Wang, Jia-Ching; Ho, Chin-Wen; Lu, Xugang; Tsao, Yu | ||||
56 | 2016 | A linear regression model with dynamic pulse transit time features for noninvasive blood pressure prediction | Hsieh, Yi-Yen; Wu, Ching-Da; Lu, Shey-Shi; Tsao, Yu | ||||
57 | 2016 | Improving the performance of speech perception in noisy environment based on an FAME strategy | Lai, Ying-Hui; Wang, Syu-Siang; Su, Yu-Ting; Han-Che, Cheng; Fu, Fan Kang; Tsao, Yu | ||||
58 | 2016 | SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement | Fu, Szu-Wei; Tsao, Yu ; Lu, Xugang | ||||
59 | 2016 | Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification | Lu, Xugang; Shen, Peng; Tsao, Yu ; Kawai, Hisashi | ||||
60 | 2016 | Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation | Hung-Shin Lee; Yu Tsao ; Chi-Chun Lee; Hsin-Min Wang ; Wei-Cheng Lin; Wei-Chen Chen; Shan-Wen Hsiao; Shyh-Kang Jeng | ||||
61 | 2016 | Locally Linear Embedding for Exemplar-Based Spectral Conversion | Yi-Chiao Wu; Hsin-Te Hwang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang | ||||
62 | 2016 | Nonnegative matrix factorization-based frequency lowering technology for Mandarin-speaking hearing aid users | Liu, Yen-Teh; Tsao, Yu ; Chang, Ronald Y. |