IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs
David Ma,Yuanxing Zhang, Jincheng Ren, Jarvis Guo, Yifan Yao, Zhenlin Wei, Zhenzhu Yang, Zhongyuan Peng, Boyu Feng, Jun Ma, Xiao Gu, Zhoufutu Wen, King Zhu, Yancheng He,Meng Cao,Shiwen Ni,Jiaheng Liu,Wenhao Huang, Ge Zhang,Xiaojie Jin arxiv(2025)
AI 理解论文
溯源树
样例
