CoLLAT: on Adding Fine-grained Audio Understanding to Language Models Using Token-Level Locked-Language Tuning.
NeurIPS 2023(2023)
Key words
Audio Understanding,Contrastive Learning,Audio-Language Grounding
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined