OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
Yongrui Chen, Zhiqiang Liu, Jing Yu, Lin Ren,Nan Hu, Xinbang Dai,Jiajun Liu, Jiazhen Kang, Shenyu Zhang, Xinda Wang, Keyan Ding, Pengfei Shen, Haolei Zhu, Hongjie Deng, Yisong Wang,Tongtong Wu, Sheng Bi,Wen Zhang,Tianxing Wu,Qiu Ji,Haofen Wang,Wenliang Chen,Huajun Chen,Guilin Qi arxiv(2025)
AI 理解论文
溯源树
样例
