General Scales Unlock AI Evaluation with Explanatory and Predictive Power
Lexin Zhou, Lorenzo Pacchiardi,Fernando Martínez-Plumed, Katherine M. Collins,Yael Moros-Daval, Seraphina Zhang, Qinlin Zhao, Yitian Huang, Luning Sun, Jonathan E. Prunty, Zongqian Li, Pablo Sánchez-García, Kexin Jiang Chen, Pablo A. M. Casares,Jiyun Zu, John Burden, Behzad Mehrbakhsh,David Stillwell,Manuel Cebrian,Jindong Wang,Peter Henderson, Sherry Tongshuang Wu,Patrick C. Kyllonen,Lucy Cheke,Xing Xie, José Hernández-Orallo CoRR(2025)
AI 理解论文
溯源树
样例
