谷歌浏览器插件
订阅小程序
在清言上使用

Towards Best Practices for Open Datasets for LLM Training

Stefan Baack,Stella Biderman, Kasia Odrozek,Aviya Skowron, Ayah Bdeir, Jillian Bommarito,Jennifer Ding, Maximilian Gahntz, Paul Keller, Pierre-Carl Langlais, Greg Lindahl, Sebastian Majstorovic,Nik Marda, Guilherme Penedo, Maarten Van Segbroeck, Jennifer Wang,Leandro von Werra, Mitchell Baker, Julie Belião, Kasia Chmielinski,Marzieh Fadaee, Lisa Gutermuth, Hynek Kydlíček, Greg Leppert, EM Lewis-Jong, Solana Larsen,Shayne Longpre, Angela Oduor Lungati, Cullen Miller, Victor Miller,Max Ryabinin, Kathleen Siminyu, Andrew Strait, Mark Surman, Anna Tumadóttir,Maurice Weber, Rebecca Weiss, Lee White,Thomas Wolf

CoRR(2025)

引用 0|浏览22
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要