BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Terry Yue Zhuo,Minh Chien Vu,Jenny Chim,Han Hu,Wenhao Yu,Ratnadira Widyasari,Imam Nur Bani Yusuf,Haolan Zhan,Junda He,Indraneil Paul, Simon Brunner, Chen GONG,James Hoang,Armel Zebaze, Xiaoheng Hong,Wen-Ding Li,Jean Kaddour, Ming Xu,Zhihan Zhang,Prateek Yadav,Naman Jain,Alex Gu,Zhoujun Cheng,Jiawei Liu,Qian Liu,Zijian Wang,David Lo,Binyuan Hui,Niklas Muennighoff,Daniel Fried,Xiaoning Du,Harm de Vries,Leandro Von Werra ICLR 2025(2025)
AI 理解论文
溯源树
样例
