订阅小程序
旧版功能

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models

2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)(2023)

引用 4|浏览30
关键词
Sound synthesis,audio generation,multimodal learning,diffusion models,neural networks,machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要