Publications

Results 1-69 of 69 (Search time: 0.001 seconds).

Issue DateTitleAuthor(s)RelationscopusWOSFulltext/Archive link
12022EMGSE: Acoustic/EMG Fusion for Multimodal Speech EnhancementKuan-Chen Wang; Kai-Chun Liu; Hsin-Min Wang ; Yu Tsao 
22022Partially Fake Audio Detection by Self-attention-based Fake Span DiscoveryHaibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng
32021Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice ConversionYi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang 
42021HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment NetworkHsin-Tien Chiang; Yi-Chiao Wu; Cheng Yu; Tomoki Toda; Hsin-Min Wang ; Yih-Chun Hu; Yu Tsao 
52021Answering Chinese Elementary School Social Study Multiple Choice QuestionsChao-Chun Liang; Daniel Lee; Meng-Tse Wu; Hsin-Min Wang ; Keh-Yih Su International Journal of Computational Linguistics and Chinese Language Processing 26(2), 67-84
62021Generation of Speaker Representations Using Heterogeneous Training Batch AssemblyYu-Huai Peng; Hung-Shin Lee; Pin-Tuan Huang; Hsin-Min Wang 
72021Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel AttentionQian-Bei Hong; Chung-Hsien Wu; Thanh Binh Nguyen; Hsin-Min Wang 
82021Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence ModelingMing-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang 
92021Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy ConditionsMd Mahbub E Noor|; Yen-Ju Lu; Syu-Siang Wang; Supratip Ghose; Chia-Yu Chang; Ryandhimas E. Zezario; Shafique Ahmed; Wei-Ho Chung; Yu Tsao ; Hsin-Min Wang 
102021SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise ContoursYi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang 
112021A Flexible and Extensible Framework for Multiple Answer Modes Question AnsweringCheng-Chung Fan; Keh-Yih Su ; Kuan-Yu Chen; Yu Tsao ; Jia-Zhi Guo; Shang-Bao Luo; Pei-Jun Liao; Kuang-Yu Chang; Chiao-Wei Hsu; Meng-Tse Wu; Shih-Hong Tsai; Tzu-Man Wu; Aleksandra Smolka; Chao-Chun Liang; Hsin-Min Wang 
122021Mining Commonsense and Domain Knowledge from Math Word ProblemsShih-Hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su 
132021Dual-Path Filter Network: Speaker-Aware Modeling for Speech SeparationFan-Lin Wang; Yu-Huai Peng; Hung-Shin Lee; Hsin-Min Wang 
142021A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice ConversionWen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Ching-Feng Liu; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda
152021Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN VocoderYi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda
162021AlloST: Low-resource Speech Translation without Source TranscriptionYao-Fei Cheng; Hung-Shin Lee; Hsin-Min Wang 
172021Speech Enhancement with Zero-Shot Model SelectionRyandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao 
182021Sequence to General Tree: Knowledge-Guided Geometry Word Problem SolvingShih-hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su 
192021Speech Recognition by Simply Fine-Tuning BERTWen-Chin Huang; Chia-Hua Wu; Shang-Bao Luo; Kuan-Yu Chen; Hsin-Min Wang ; Tomoki Toda
202021Melody Harmonization Using Orderless NADE, Chord Balancing, and Blocked Gibbs SamplingChung-En Sun; Yi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang 
212021MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation AccelerationYu-Tao Chang; Yuan-Hong Yang; Yu-Huai Peng; Syu-Siang Wang; Tai-Shih Chi; Yu Tsao; Hsin-Min Wang 
222021Learning To Visualize Music Through Shot Sequence For Automatic Concert Video MashupWen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Hsiao-Rong Tyan; Hsin-Min Wang ; Hong-Yuan Mark Liao IEEE Transactions on Multimedia 23, 1731-1743
232020Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural NetworksWang, Natalie Yu-Hsien; Wang, Hsiao-Lan Sharon; Wang, Tao-Wei; Fu, Szu-Wei; Lu, Xugan; Wang, Hsin-Min ; Tsao, Yu IEEE Transactions on Neural Systems and Rehabilitation Engineering 29, 184-195
242020STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment ModelRyandhimas E. Zezario; Szu-Wei Fu; Chiou-Shann Fuh; Yu Tsao; Hsin-Min Wang 
252020Subspace-based Representation and Learning for Phonotactic Spoken Language RecognitionLee, Hung-Shin; Tsao, Yu ; Jeng, Shyh-Kang; Wang, Hsin-Min IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 3065-3079
262020WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech EnhancementTsun-An Hsieh; Hsin-Min Wang ; Xugang Lu; Yu Tsao IEEE Signal Processing Letters 27, 2149-2153
272020ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speechXin Wang; Junichi Yamagishi; Massimiliano Todisco; Hector Delgado; Andreas Nautsch; Nicholas Evans; Md Sahidullah; Ville Vestman; Tomi Kinnunen; Kong Aik Lee; Lauri Juvela; Paavo Alku; Yu-Huai Peng; Hsin-Te Hwang; Yu Tsao; Hsin-Min Wang ; Sebastien Le Maguer; Markus Becker; Fergus Henderson; Rob Clark; Yu Zhang; Quan Wang; Ye Jia; Kai Onuma; Koji Mushika; Takashi Kaneda; Yuan Jiang; Li-Juan Liu; Yi-Chiao Wu; Wen-Chin Huang; Tomoki Toda; Kou Tanaka; Hirokazu Kameoka; Ingmar Steiner; Driss Matrouf; Jean-Francois Bonastre; Avashna Govender; Srikanth Ronanki; Jing-Xuan Zhang; Zhen-Hua LingComputer Speech and Language 64, 101114
282020Using Taigi Dramas with Mandarin Chinese Subtitles to Improve Taigi Speech RecognitionPin-Yuan Chen; Chia-Hua Wu; Hung-Shin Lee; Shao-Kang Tsao; Ming-Tat Ko ; Hsin-Min Wang 
292020Joint Training of Guided Learning and Mean Teacher Models for Sound Event DetectionHao Yen; Pin-Jui Ku; Ming-Chi Yen; Hung-Shin Lee; Hsin-Min Wang 
302020Speech Enhancement based on Denoising Autoencoder with Multi-branched EncodersCheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769
312020SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental LearningChi-Chang Lee; Yu-Chen Lin; Hsuan-Tien Lin; Hsin-Min Wang ; Yu Tsao
322020The Academia Sinica Systems of Voice Conversion for VCC2020Yu-Huai Peng; Cheng-Hung Hu; Alexander Kang; Hung-Shin Lee; Pin-Yuan Chen; Yu Tsao; Hsin-Min Wang 
332020Lite Audio-Visual Speech EnhancementShang-Yi Chuang; Yu Tsao; Chen-Chou Lo; Hsin-Min Wang 
342020Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice ConversionHuang, Wen-Chin; Luo, Hao; Hwang, Hsin-Te; Lo, Chen-Chou; Peng, Yu-Huai; Tsao, Yu ; Wang, Hsin-Min IEEE Transactions on Emerging Topics in Computational Intelligence 4(4), 468-479
352020Combining Deep Embeddings of Acoustic and Articulatory Features for Speaker IdentificationQian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang ; Chien-Lin Huang
362020Statistics Pooling Time Delay Neural Network Based on X-Vector for Speaker VerificationQian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang ; Chien-Lin Huang
372020Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech EnhancementRyandhimas Edo Zezario; Tassadaq Hussain; Xugang Lu; Hsin-Min Wang ; Yu Tsao
382020The Academia Sinica Systems of Speech Recognition and Speaker Diarization for the CHiME-6 ChallengeHung-Shin Lee; Yu-Huai Peng; Pin-Tuan Huang; Ying-Chun Tseng; Chia-Hua Wu; Yu Tsao; Hsin-Min Wang 
392020Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional NetworksChang-Le Liu; Sze-Wei Fu; You-Jin Li; Jen-Wei Huang; Hsin-Min Wang ; Yu Tsao IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1888-1900
402020Diachronous initiation of post-collisional magmatism in the Arabia-Eurasia collision zoneLin, Yu-Chin; Chung, Sun-Lin ; Bingöl, A. Feyzi; Yang, Liekun; Okrostsvaridze, Avtandil; Pang, Kwan-Nang ; Lee, Hao-Yang ; Lin, Te-HsienLithos 356-357, 105394
412019Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural NetworksShang-Bao Luo; Hung-Shin Lee; Kuan-Yu Chen; Hsin-Min Wang 
422019Compressed Multimodel Hierarchical Extreme Learning Machine for Speech EnhancementTassadaq Hussain; Yu Tsao; Hsin-Min Wang ; Jia-Ching Wang; Sabato Marco Siniscalchi; Wen-Hung Liao
432019Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature EnhancementWei-Cheng Lin; Yu Tsao; Hsin-Min Wang ; Fei Chen
442019Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker IdentificationQian-Bei Hong; Chung-Hsien Wu; Ming-Hsiang Su; Hsin-Min Wang 
452019Multi-task Learning for Mandarin Acoustic Modeling Using Articulatory AttributesYueh-Ting Lee; Xuan-Bo Chen; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang 
462019Improving Automatic Jazz Melody Generation by Transfer Learning TechniquesHsiao-Tzu Hung; Chung-Yang Wang; Yi-Hsuan Yang ; Hsin-Min Wang 
472019Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task LearningXuan-Bo Chen; Yueh-Ting Lee; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang 
482019Influences of Prosodic Feature Replacement on the Perceived Singing Voice IdentityKuan-Yi Kang; Yi-Wen Liu; Hsin-Min Wang 
492019Audio-Visual Speech Enhancement using Hierarchical Extreme Learning MachineTassadaq Hussain; Yu Tsao; Hsin-Min Wang ; Jia-Ching Wang; Sabato Marco Siniscalchi; Wen-Hung Liao
502019Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment MetricRyandhimas Edo Zezario; Szu-wei Fu; Xugang Lu; Hsin-Min Wang ; Yu Tsao
512019Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice ConversionWen-Chin Huang; Yi-Chiao Wu; Chen-Chou Lo; Patrick Lumban Tobing; Tomoki Hayashi; Kazuhiro Kobayashi; Tomoki Toda; Yu Tsao; Hsin-Min Wang 
522019Exploring the Encoder Layers of Discriminative Autoencoders for LVCSRPin-Tuan Huang; Hung-Shin Lee; Syu-Siang Wang; Kuan-Yu Chen; Yu Tsao; Hsin-Min Wang 
532019MOSNet: Deep Learning based Objective Assessment for Voice ConversionChen-Chou Lo; Szu-Wei Fu; Wen-Chin Huang; Xin Wang; Junichi Yamagishi; Yu Tsao; Hsin-Min Wang 
542019Noise Adaptive Speech Enhancement using Domain Adversarial TrainingChien-Feng Liao; Yu Tsao; Hung-Yi Lee; Hsin-Min Wang 
552019Generalization of Spectrum Differential based Direct Waveform Modification for Voice ConversionWen-Chin Huang; Yi-Chiao Wu; Kazuhiro Kobayashi; Yu-Huai Peng; Hsin-Te Hwang; Patrick Lumban Tobing; Yu Tsao; Hsin-Min Wang ; Tomoki Toda
562019Reinforcement Learning based Speech Enhancement for Robust Speech RecognitionYih-Liang Shen; Chao-Yuan Huang; Syu-Siang Wang; Yu Tsao; Hsin-Min Wang ; Tai-Shih Chi
572019Bone-conducted Speech Enhancement using Hierarchical Extreme Learning MachineTassadaq Hussain; Yu Tsao; Sabato Marco Siniscalchi; Jia-Ching Wang; Hsin-Min Wang ; Wen-Hung Liao
582018SeeTheVoice: Learning from Music to Visual Storytelling of ShotsWen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Yi-Hsuan Yang ; Hsin-Min Wang ; Hsiao-Rong Tyan; Hong-Yuan Mark Liao 
592017Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial NetworksChin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang 
602017Discriminative Autoencoders for Acoustic ModelingMing-Han Yang; Hung-Shin Lee; Yu-Ding Lu; Kuan-Yu Chen; Yu Tsao ; Berlin Chen; Hsin-Min Wang 
612017A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech EnhancementYi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang 
622017Wavelet Speech Enhancement Based on Robust Principal Component AnalysisChia-Lung Wu; Hsiang-Ping Hsu; Syu-Siang Wang; Jeih-Weih Hung; Ying-Hui Lai; Hsin-Min Wang ; Yu Tsao 
632017Discriminative Autoencoders for Speaker VerificationHung-Shin Lee; Yu-Ding Lu; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang ; Shyh-Kang Jeng
642017A Locally Linear Embbeding Based Postfiltering Approach for Speech EnhancementYi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Ying-Hui Lai; Yu Tsao ; Hsin-Min Wang 
652016Audio-Visual Speech Enhancement using Deep Neural NetworksJen-Cheng Hou; Syu-Siang Wang; Ying-Hui Lai; Jen-Chun Lin; Yu Tsao ; Hsiu-Wen Chang; Hsin-Min Wang 
662016Voice Conversion from Non-parallel Corpora Using Variational Auto-encoderChin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang 
672016Locally Linear Embedding for Exemplar-Based Spectral ConversionYi-Chiao Wu; Hsin-Te Hwang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang 
682016Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity EvaluationHung-Shin Lee; Yu Tsao ; Chi-Chun Lee; Hsin-Min Wang ; Wei-Cheng Lin; Wei-Chen Chen; Shan-Wen Hsiao; Shyh-Kang Jeng
692001Comparative Analysis for Data-Driven Temporal Filters Obtained via Principal Component AnalysisHung, Jeih-weih; Wang, Hsin-min ; Lee, Lin-shan