Qwen-Audio: Advancing Universal Audio Understanding Via Unified Large-Scale Audio-Language Models
arXiv (Cornell University)(2023)
Key words
Audio Event Detection,Acoustic Modeling,Audio-Visual Speech Recognition,Speech Enhancement,Environmental Sound Recognition
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined