Qwen-Audio: Advancing Universal Audio Understanding Via Unified Large-Scale Audio-Language Models

arXiv (Cornell University)（2023）

Cited 313|Views2202

Key words

Audio Event Detection,Acoustic Modeling,Audio-Visual Speech Recognition,Speech Enhancement,Environmental Sound Recognition

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined