Results 1-42 of 42 (Search time: 0.005 seconds).
Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link | |
---|---|---|---|---|---|---|---|
1 | 2023 | Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features | Ryandhimas E. Zezario; Szu-Wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 54-70 | |||
2 | 2022 | Speech-enhanced and Noise-aware Networks for Robust Speech Recognition | Hung-Shin Lee; Pin-Yuan Chen; Yao-Fei Cheng; Yu Tsao ; Hsin-Min Wang | ||||
3 | 2022 | Chinese Movie Dialogue Question Answering Dataset | Shang-Bao Luo; Hsin-Min Wang ; Kuan-Yu Chen; Keh-Yih Su ; Yu Tsao ; Cheng-Chung Fan | ||||
4 | 2022 | Filter-based Discriminative Autoencoders for Children Speech Recognition | Chiang-Lin Tai; Hung-Shin Lee; Yu Tsao ; Hsin-Min Wang | ||||
5 | 2022 | Partially Fake Audio Detection by Self-attention-based Fake Span Discovery | Haibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng | ||||
6 | 2022 | EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement | Kuan-Chen Wang; Kai-Chun Liu; Hsin-Min Wang ; Yu Tsao | ||||
7 | 2022 | SVSNet: An End-to-end Speaker Voice Similarity Assessment Model | Cheng-Hung Hu; Yu-Huai Peng; Junichi Yamagishi; Yu Tsao ; Hsin-Min Wang | IEEE Signal Processing Letters 29, 767-771 | |||
8 | 2022 | Boundary-Preserved Deep Denoising of Stochastic Resonance Enhanced Multiphoton Images | Niu, Sheng-Yong; Guo, Lun-Zhang; Li, Yue; Zhang, Zhiming; Wang, Tzung-Dau; Liu, Kai-Chun; Li, You-Jin; Tsao, Yu ; Liu, Tzu-Ming | IEEE Journal of Translational Engineering in Health and Medicine 10, 1800812 | |||
9 | 2022 | Improved Lite Audio-Visual Speech Enhancement | Shang-Yi Chuang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1345-1359 | |||
10 | 2021 | Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification | X. Lu; P. Shen; Y. Tsao ; H. Kawai | ||||
11 | 2021 | MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder | Y.-J. Li; S.-S. Wang; Y. Tsao ; B. Su | ||||
12 | 2021 | A Study on Speech Enhancement Based on Diffusion Probabilistic Model | Y.-J. Lu; Y. Tsao ; S. Watanabe | ||||
13 | 2021 | Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues | Z. Feng; Yu Tsao ; F. Chen | ||||
14 | 2021 | Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice Conversion | Yi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang | ||||
15 | 2021 | HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment Network | Hsin-Tien Chiang; Yi-Chiao Wu; Cheng Yu; Tomoki Toda; Hsin-Min Wang ; Yih-Chun Hu; Yu Tsao | ||||
16 | 2021 | An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition | X. Chang; T. Maekaku; P. Guo; J. Shi; Y.-J. Lu; A. S. Subramanian; T. Wang; S.-w. Yang; Y. Tsao ; H.-y. Lee; S. Watanabe | ||||
17 | 2021 | Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport | H.-Y. Lin; H.-H. Tseng; X. Lu; Yu Tsao | ||||
18 | 2021 | Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling | Ming-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang | ||||
19 | 2021 | Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions | Md Mahbub E Noor|; Yen-Ju Lu; Syu-Siang Wang; Supratip Ghose; Chia-Yu Chang; Ryandhimas E. Zezario; Shafique Ahmed; Wei-Ho Chung; Yu Tsao ; Hsin-Min Wang | ||||
20 | 2021 | A Flexible and Extensible Framework for Multiple Answer Modes Question Answering | Cheng-Chung Fan; Keh-Yih Su ; Kuan-Yu Chen; Yu Tsao ; Jia-Zhi Guo; Shang-Bao Luo; Pei-Jun Liao; Kuang-Yu Chang; Chiao-Wei Hsu; Meng-Tse Wu; Shih-Hong Tsai; Tzu-Man Wu; Aleksandra Smolka; Chao-Chun Liang; Hsin-Min Wang | ||||
21 | 2021 | Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder | Yi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
22 | 2021 | A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion | Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Ching-Feng Liu; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda | ||||
23 | 2021 | Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement | T.-A. Hsieh; C. Yu; S.-W. Fu; X. Lu; Y. Tsao | ||||
24 | 2021 | QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization | Gang-Xuan Lin; Shih-Wei Hu; Yen-Ju Lu; Yu Tsao ; Chun-Shien Lu | ||||
25 | 2021 | MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement | S.-W. Fu; C. Yu; T.-A. Hsieh; P. Plantinga; M. Ravanelli; X. Lu; Y. Tsao | ||||
26 | 2021 | Speech Enhancement with Zero-Shot Model Selection | Ryandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao | ||||
27 | 2021 | A Study of Incorporating Articulatory Movement Information in Speech Enhancement | Y.-W. Chen; K.-H. Hung; S.-Y. Chuang; J. Sherman; X. Lu; Y. Tsao | ||||
28 | 2021 | Instrumented Shoulder Functional Assessment using Inertial Measurement Units for Frozen Shoulder | T.-Y. Lu; K.-C. Liu; C.-Y. Hsieh; C.-Y. Chang; Y. Tsao ; C.-T. Chan | ||||
29 | 2021 | Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario | C.-J. Peng; Y.-J. Chan; C. Yu; S.-S. Wang; Y. Tsao ; T.-S. Chi | ||||
30 | 2021 | EMA2S: An End-to-End Multimodal Articulatory-to-Speech System | Y.-W. Chen; K.-H. Hung; S.-Y. Chuang; J. Sherman; W.-C. Huang; X. Lu; Y. Tsao | ||||
31 | 2021 | MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration | Y.-T. Chang; Y.-H. Yang; Y.-H. Peng; S.-S. Wang; T.-S. Chi; Y. Tsao ; H.-M. Wang | ||||
32 | 2020 | Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks | Wang, Natalie Yu-Hsien; Wang, Hsiao-Lan Sharon; Wang, Tao-Wei; Fu, Szu-Wei; Lu, Xugan; Wang, Hsin-Min ; Tsao, Yu | IEEE Transactions on Neural Systems and Rehabilitation Engineering 29, 184-195 | |||
33 | 2020 | Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition | Lee, Hung-Shin; Tsao, Yu ; Jeng, Shyh-Kang; Wang, Hsin-Min | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 3065-3079 | |||
34 | 2020 | WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement | Tsun-An Hsieh; Hsin-Min Wang ; Xugang Lu; Yu Tsao | IEEE Signal Processing Letters 27, 2149-2153 | |||
35 | 2020 | Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders | Cheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769 | |||
36 | 2020 | Blind monaural source separation on heart and lung sounds based on periodic-coded deep autoencoder | Kun-Hsi Tsai; Wei-Chien Wang; Chui-Hsuan Cheng; Chan-Yen Tsai; Jou-Kou Wang; Tzu-Hao Lin ; Shih-Hau Fang; Li-Chin Chen; Yu Tsao | IEEE Journal of Biomedical and Health Informatics 24(11), 3203-3214 | |||
37 | 2020 | Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion | Huang, Wen-Chin; Luo, Hao; Hwang, Hsin-Te; Lo, Chen-Chou; Peng, Yu-Huai; Tsao, Yu ; Wang, Hsin-Min | IEEE Transactions on Emerging Topics in Computational Intelligence 4(4), 468-479 | |||
38 | 2020 | Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks | Chang-Le Liu; Sze-Wei Fu; You-Jin Li; Jen-Wei Huang; Hsin-Min Wang ; Yu Tsao | IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1888-1900 | |||
39 | 2020 | A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation | Tseng, Rung-Yu; Wang, Tao-Wei; Fu, Szu-Wei; Fu, Szu-Wei; Lee, Chia-Ying; Tsao, Yu | IEEE Transactions on Cognitive and Developmental Systems (Early Access) | |||
40 | 2020 | Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement | Yu, Cheng; Hung, Kuo-Hsuan; Wang, Syu-Siang; Tsao, Yu ; Hung, Jeih-weih | IEEE Signal Processing Letters 27, 1035-1039 | |||
41 | 2020 | Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media | Abousaleh, Fatma S,; Cheng, Wen-Huang; Yu, Neng-Hao; Tsao, Yu | IEEE Transactions on Cognitive and Developmental Systems 13(3), 679-692 | |||
42 | 2020 | Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes | Hidayati, Shintami Chusnul; Goh, Ting Wei; Chan, Ji-Sheng Gary; Hsu, Cheng-Chun; See, John; Wong, Lai-Kuan; Hua, Kai-Lung; Tsao, Yu ; Cheng, Wen-Huang | IEEE Transactions on Multimedia 23, 365-377 |