Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Organizations
  • Researchers
  • Research Outputs
  • Projects
  • Explore by
    • Organizations
    • Researchers
    • Research Outputs
    • Projects
  • Academic & Publications
  • Sign in
  • 中文
  • English
  1. Scholars Hub of the Academia Sinica
  2. 數理科學組
  3. 資訊科學研究所
Please use this identifier to cite or link to this item: http://ir.sinica.edu.tw/handle/201000000A/90263
Title: SpeechCLIP : Self-supervised multi-task representation learning for speech via CLIP and speech-image data
Authors: Hsuan-Fu Wang
Yi-Jen Shih
Heng-Jui Chang
Layne Berry
Puyuan Peng
Hung-yi Lee
Hsin-Min Wang 
David Harwath
Issue Date: 2024-04-14
Conference: IEEE ICASSP 2024 Workshop: Self-supervision in Audio, Speech and Beyond
URI: http://ir.sinica.edu.tw/handle/201000000A/90263
Appears in Collections:資訊科學研究所

Show full item record

Page view(s)

10
checked on May 21, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Explore by
  • Academic & Publications
  • Organizations
  • Researchers
  • Research Outputs
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback