http://ir.sinica.edu.tw/handle/201000000A/90263
Title: | SpeechCLIP : Self-supervised multi-task representation learning for speech via CLIP and speech-image data |
Authors: | Hsuan-Fu Wang Yi-Jen Shih Heng-Jui Chang Layne Berry Puyuan Peng Hung-yi Lee Hsin-Min Wang David Harwath |
Issue Date: | 2024-04-14 |
Conference: | IEEE ICASSP 2024 Workshop: Self-supervision in Audio, Speech and Beyond |
URI: | http://ir.sinica.edu.tw/handle/201000000A/90263 |
Appears in Collections: | 資訊科學研究所 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.