Publications

Results 1-87 of 87 (Search time: 0.004 seconds).

Issue DateTitleAuthor(s)RelationscopusWOSFulltext/Archive link
12023Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker VerificationQian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 486-499
22023Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain FeaturesRyandhimas E. Zezario; Szu-Wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 54-70
32022Speech-enhanced and Noise-aware Networks for Robust Speech RecognitionHung-Shin Lee; Pin-Yuan Chen; Yao-Fei Cheng; Yu Tsao ; Hsin-Min Wang 
42022Aligning Sentences in a Paragraph-Paraphrased Corpus with New Embedding-based Similarity MeasuresAleksandra Smolka; Hsin-Min Wang ; Jason S. Chang; Keh-Yih Su International Journal of Computational Linguistics and Chinese Language Processing(中文計算語言學期刊) 27(2), 1-30
52022Multimodal Forgery Detection Using Ensemble LearningAmmarah Hashmi; Sahibzada Adil Shahzad; Chia-Wen Lin; Yu Tsao; Hsin-Min Wang 
62022Lip Sync Matters: A Novel Multimodal Forgery DetectorSahibzada Adil Shahzad; Ammarah Hashmi; Sarwar Khan; Yan-Tsung Peng; Yu Tsao; Hsin-Min Wang 
72022Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GANYin-Ping Cho; Yu Tsao; Hsin-Min Wang ; Yi-Wen Liu
82022Speech Enhancement-Assisted Voice Conversion in Noisy EnvironmentsYun-Ju Chan; Chiang-Jen Peng; Syu-Siang Wang; Hsin-Min Wang ; Yu Tsao; Tai-Shih Chi
92022Chinese Movie Dialogue Question Answering DatasetShang-Bao Luo; Hsin-Min Wang ; Kuan-Yu Chen; Keh-Yih Su ; Yu Tsao ; Cheng-Chung Fan
102022Detecting Replay Attacks Using Single-Channel Audio: The Temporal Autocorrelation of SpeechShih-Kuang Lee; Yu Tsao; Hsin-Min Wang 
112022MTI-Net: A Multi-Target Speech Intelligibility Prediction ModelRyandhimas Edo Zezario; Szu-wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao
122022Chain-based Discriminative Autoencoders for Speech RecognitionHung-Shin Lee; Pin-Tuan Huang; Yao-Fei Cheng; Hsin-Min Wang 
132022Disentangling the Impacts of Language and Channel Variability on Speech Separation NetworksFan-Lin Wang; Hung-Shin Lee; Yu Tsao; Hsin-Min Wang 
142022MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing AidsRyandhimas Edo Zezario; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao
152022The VoiceMOS Challenge 2022Wen Chin Huang; Erica Cooper; Yu Tsao; Hsin-Min Wang ; Tomoki Toda; Junichi Yamagishi
162022Filter-based Discriminative Autoencoders for Children Speech RecognitionChiang-Lin Tai; Hung-Shin Lee; Yu Tsao ; Hsin-Min Wang 
172022Partially Fake Audio Detection by Self-attention-based Fake Span DiscoveryHaibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng
182022EMGSE: Acoustic/EMG Fusion for Multimodal Speech EnhancementKuan-Chen Wang; Kai-Chun Liu; Hsin-Min Wang ; Yu Tsao 
192022Improved Lite Audio-Visual Speech EnhancementShang-Yi Chuang; Hsin-Min Wang ; Yu Tsao IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1345-1359
202022SVSNet: An End-to-end Speaker Voice Similarity Assessment ModelCheng-Hung Hu; Yu-Huai Peng; Junichi Yamagishi; Yu Tsao ; Hsin-Min Wang IEEE Signal Processing Letters 29, 767-771
212021Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice ConversionYi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang 
222021HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment NetworkHsin-Tien Chiang; Yi-Chiao Wu; Cheng Yu; Tomoki Toda; Hsin-Min Wang ; Yih-Chun Hu; Yu Tsao 
232021Generation of Speaker Representations Using Heterogeneous Training Batch AssemblyYu-Huai Peng; Hung-Shin Lee; Pin-Tuan Huang; Hsin-Min Wang 
242021Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence ModelingMing-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang 
252021Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel AttentionQian-Bei Hong; Chung-Hsien Wu; Thanh Binh Nguyen; Hsin-Min Wang 
262021Answering Chinese Elementary School Social Study Multiple Choice QuestionsChao-Chun Liang; Daniel Lee; Meng-Tse Wu; Hsin-Min Wang ; Keh-Yih Su International Journal of Computational Linguistics and Chinese Language Processing 26(2), 67-84
272021Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy ConditionsMd Mahbub E Noor|; Yen-Ju Lu; Syu-Siang Wang; Supratip Ghose; Chia-Yu Chang; Ryandhimas E. Zezario; Shafique Ahmed; Wei-Ho Chung; Yu Tsao ; Hsin-Min Wang 
282021SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise ContoursYi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang 
292021A Flexible and Extensible Framework for Multiple Answer Modes Question AnsweringCheng-Chung Fan; Keh-Yih Su ; Kuan-Yu Chen; Yu Tsao ; Jia-Zhi Guo; Shang-Bao Luo; Pei-Jun Liao; Kuang-Yu Chang; Chiao-Wei Hsu; Meng-Tse Wu; Shih-Hong Tsai; Tzu-Man Wu; Aleksandra Smolka; Chao-Chun Liang; Hsin-Min Wang 
302021Mining Commonsense and Domain Knowledge from Math Word ProblemsShih-Hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su 
312021AlloST: Low-resource Speech Translation without Source TranscriptionYao-Fei Cheng; Hung-Shin Lee; Hsin-Min Wang 
322021Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN VocoderYi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda
332021Dual-Path Filter Network: Speaker-Aware Modeling for Speech SeparationFan-Lin Wang; Yu-Huai Peng; Hung-Shin Lee; Hsin-Min Wang 
342021A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice ConversionWen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Ching-Feng Liu; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda
352021Speech Enhancement with Zero-Shot Model SelectionRyandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao 
362021Sequence to General Tree: Knowledge-Guided Geometry Word Problem SolvingShih-hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su 
372021Speech Recognition by Simply Fine-Tuning BERTWen-Chin Huang; Chia-Hua Wu; Shang-Bao Luo; Kuan-Yu Chen; Hsin-Min Wang ; Tomoki Toda
382021Melody Harmonization Using Orderless NADE, Chord Balancing, and Blocked Gibbs SamplingChung-En Sun; Yi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang 
392021MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation AccelerationY.-T. Chang; Y.-H. Yang; Y.-H. Peng; S.-S. Wang; T.-S. Chi; Y. Tsao ; H.-M. Wang 
402021Learning To Visualize Music Through Shot Sequence For Automatic Concert Video MashupWen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Hsiao-Rong Tyan; Hsin-Min Wang ; Hong-Yuan Mark Liao IEEE Transactions on Multimedia 23, 1731-1743
412020Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural NetworksWang, Natalie Yu-Hsien; Wang, Hsiao-Lan Sharon; Wang, Tao-Wei; Fu, Szu-Wei; Lu, Xugan; Wang, Hsin-Min ; Tsao, Yu IEEE Transactions on Neural Systems and Rehabilitation Engineering 29, 184-195
422020STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment ModelRyandhimas E. Zezario; Szu-Wei Fu; Chiou-Shann Fuh; Yu Tsao; Hsin-Min Wang 
432020Subspace-based Representation and Learning for Phonotactic Spoken Language RecognitionLee, Hung-Shin; Tsao, Yu ; Jeng, Shyh-Kang; Wang, Hsin-Min IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 3065-3079
442020ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speechXin Wang; Junichi Yamagishi; Massimiliano Todisco; Hector Delgado; Andreas Nautsch; Nicholas Evans; Md Sahidullah; Ville Vestman; Tomi Kinnunen; Kong Aik Lee; Lauri Juvela; Paavo Alku; Yu-Huai Peng; Hsin-Te Hwang; Yu Tsao; Hsin-Min Wang ; Sebastien Le Maguer; Markus Becker; Fergus Henderson; Rob Clark; Yu Zhang; Quan Wang; Ye Jia; Kai Onuma; Koji Mushika; Takashi Kaneda; Yuan Jiang; Li-Juan Liu; Yi-Chiao Wu; Wen-Chin Huang; Tomoki Toda; Kou Tanaka; Hirokazu Kameoka; Ingmar Steiner; Driss Matrouf; Jean-Francois Bonastre; Avashna Govender; Srikanth Ronanki; Jing-Xuan Zhang; Zhen-Hua LingComputer Speech and Language 64, 101114
452020Joint Training of Guided Learning and Mean Teacher Models for Sound Event DetectionHao Yen; Pin-Jui Ku; Ming-Chi Yen; Hung-Shin Lee; Hsin-Min Wang 
462020Using Taigi Dramas with Mandarin Chinese Subtitles to Improve Taigi Speech RecognitionPin-Yuan Chen; Chia-Hua Wu; Hung-Shin Lee; Shao-Kang Tsao; Ming-Tat Ko ; Hsin-Min Wang 
472020WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech EnhancementTsun-An Hsieh; Hsin-Min Wang ; Xugang Lu; Yu Tsao IEEE Signal Processing Letters 27, 2149-2153
482020SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental LearningChi-Chang Lee; Yu-Chen Lin; Hsuan-Tien Lin; Hsin-Min Wang ; Yu Tsao
492020Speech Enhancement based on Denoising Autoencoder with Multi-branched EncodersCheng Yu; Ryandhimas E. Zezario; Syu-Siang Wang; Jonathan Sherman; Yi-Yen Hsieh; Xugang Lu; Hsin-Min Wang ; Yu Tsao IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2756-2769
502020The Academia Sinica Systems of Voice Conversion for VCC2020Yu-Huai Peng; Cheng-Hung Hu; Alexander Kang; Hung-Shin Lee; Pin-Yuan Chen; Yu Tsao; Hsin-Min Wang 
512020Lite Audio-Visual Speech EnhancementShang-Yi Chuang; Yu Tsao; Chen-Chou Lo; Hsin-Min Wang 
522020Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice ConversionHuang, Wen-Chin; Luo, Hao; Hwang, Hsin-Te; Lo, Chen-Chou; Peng, Yu-Huai; Tsao, Yu ; Wang, Hsin-Min IEEE Transactions on Emerging Topics in Computational Intelligence 4(4), 468-479
532020Combining Deep Embeddings of Acoustic and Articulatory Features for Speaker IdentificationQian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang ; Chien-Lin Huang
542020Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech EnhancementRyandhimas Edo Zezario; Tassadaq Hussain; Xugang Lu; Hsin-Min Wang ; Yu Tsao
552020Statistics Pooling Time Delay Neural Network Based on X-Vector for Speaker VerificationQian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang ; Chien-Lin Huang
562020The Academia Sinica Systems of Speech Recognition and Speaker Diarization for the CHiME-6 ChallengeHung-Shin Lee; Yu-Huai Peng; Pin-Tuan Huang; Ying-Chun Tseng; Chia-Hua Wu; Yu Tsao; Hsin-Min Wang 
572020Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional NetworksChang-Le Liu; Sze-Wei Fu; You-Jin Li; Jen-Wei Huang; Hsin-Min Wang ; Yu Tsao IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1888-1900
582020Diachronous initiation of post-collisional magmatism in the Arabia-Eurasia collision zoneLin, Yu-Chin; Chung, Sun-Lin ; Bingöl, A. Feyzi; Yang, Liekun; Okrostsvaridze, Avtandil; Pang, Kwan-Nang ; Lee, Hao-Yang ; Lin, Te-HsienLithos 356-357, 105394
592019Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural NetworksShang-Bao Luo; Hung-Shin Lee; Kuan-Yu Chen; Hsin-Min Wang 
602019Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature EnhancementWei-Cheng Lin; Yu Tsao; Hsin-Min Wang ; Fei Chen
612019Compressed Multimodel Hierarchical Extreme Learning Machine for Speech EnhancementTassadaq Hussain; Yu Tsao; Hsin-Min Wang ; Jia-Ching Wang; Sabato Marco Siniscalchi; Wen-Hung Liao
622019Multi-task Learning for Mandarin Acoustic Modeling Using Articulatory AttributesYueh-Ting Lee; Xuan-Bo Chen; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang 
632019Improving Automatic Jazz Melody Generation by Transfer Learning TechniquesHsiao-Tzu Hung; Chung-Yang Wang; Yi-Hsuan Yang ; Hsin-Min Wang 
642019Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker IdentificationQian-Bei Hong; Chung-Hsien Wu; Ming-Hsiang Su; Hsin-Min Wang 
652019Influences of Prosodic Feature Replacement on the Perceived Singing Voice IdentityKuan-Yi Kang; Yi-Wen Liu; Hsin-Min Wang 
662019Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task LearningXuan-Bo Chen; Yueh-Ting Lee; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang 
672019MOSNet: Deep Learning based Objective Assessment for Voice ConversionChen-Chou Lo; Szu-Wei Fu; Wen-Chin Huang; Xin Wang; Junichi Yamagishi; Yu Tsao; Hsin-Min Wang 
682019Noise Adaptive Speech Enhancement using Domain Adversarial TrainingChien-Feng Liao; Yu Tsao; Hung-Yi Lee; Hsin-Min Wang 
692019Audio-Visual Speech Enhancement using Hierarchical Extreme Learning MachineTassadaq Hussain; Yu Tsao; Hsin-Min Wang ; Jia-Ching Wang; Sabato Marco Siniscalchi; Wen-Hung Liao
702019Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment MetricRyandhimas Edo Zezario; Szu-wei Fu; Xugang Lu; Hsin-Min Wang ; Yu Tsao
712019Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice ConversionWen-Chin Huang; Yi-Chiao Wu; Chen-Chou Lo; Patrick Lumban Tobing; Tomoki Hayashi; Kazuhiro Kobayashi; Tomoki Toda; Yu Tsao; Hsin-Min Wang 
722019Exploring the Encoder Layers of Discriminative Autoencoders for LVCSRPin-Tuan Huang; Hung-Shin Lee; Syu-Siang Wang; Kuan-Yu Chen; Yu Tsao; Hsin-Min Wang 
732019Generalization of Spectrum Differential based Direct Waveform Modification for Voice ConversionWen-Chin Huang; Yi-Chiao Wu; Kazuhiro Kobayashi; Yu-Huai Peng; Hsin-Te Hwang; Patrick Lumban Tobing; Yu Tsao; Hsin-Min Wang ; Tomoki Toda
742019Reinforcement Learning based Speech Enhancement for Robust Speech RecognitionYih-Liang Shen; Chao-Yuan Huang; Syu-Siang Wang; Yu Tsao; Hsin-Min Wang ; Tai-Shih Chi
752019Bone-conducted Speech Enhancement using Hierarchical Extreme Learning MachineTassadaq Hussain; Yu Tsao; Sabato Marco Siniscalchi; Jia-Ching Wang; Hsin-Min Wang ; Wen-Hung Liao
762018SeeTheVoice: Learning from Music to Visual Storytelling of ShotsWen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Yi-Hsuan Yang ; Hsin-Min Wang ; Hsiao-Rong Tyan; Hong-Yuan Mark Liao 
772017Discriminative Autoencoders for Acoustic ModelingMing-Han Yang; Hung-Shin Lee; Yu-Ding Lu; Kuan-Yu Chen; Yu Tsao ; Berlin Chen; Hsin-Min Wang 
782017A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech EnhancementYi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang 
792017Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial NetworksChin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang 
802017Wavelet Speech Enhancement Based on Robust Principal Component AnalysisChia-Lung Wu; Hsiang-Ping Hsu; Syu-Siang Wang; Jeih-Weih Hung; Ying-Hui Lai; Hsin-Min Wang ; Yu Tsao 
812017Discriminative Autoencoders for Speaker VerificationHung-Shin Lee; Yu-Ding Lu; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang ; Shyh-Kang Jeng
822017A Locally Linear Embbeding Based Postfiltering Approach for Speech EnhancementYi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Ying-Hui Lai; Yu Tsao ; Hsin-Min Wang 
832016Audio-Visual Speech Enhancement using Deep Neural NetworksJen-Cheng Hou; Syu-Siang Wang; Ying-Hui Lai; Jen-Chun Lin; Yu Tsao ; Hsiu-Wen Chang; Hsin-Min Wang 
842016Voice Conversion from Non-parallel Corpora Using Variational Auto-encoderChin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang 
852016Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity EvaluationHung-Shin Lee; Yu Tsao ; Chi-Chun Lee; Hsin-Min Wang ; Wei-Cheng Lin; Wei-Chen Chen; Shan-Wen Hsiao; Shyh-Kang Jeng
862016Locally Linear Embedding for Exemplar-Based Spectral ConversionYi-Chiao Wu; Hsin-Te Hwang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang 
872001Comparative Analysis for Data-Driven Temporal Filters Obtained via Principal Component AnalysisHung, Jeih-weih; Wang, Hsin-min ; Lee, Lin-shan