AgentHarm: A Benchmark for Measuring Harmfulness of LLM AgentsMaksym Andriushchenko,Alexandra Souly, Mateusz Dziemian,Derek Duenas,Maxwell Lin,Justin Wang,Dan Hendrycks,Andy Zou,J Kolter,Matt Fredrikson,Yarin Gal,Xander DaviesICLR 2025(2025)引用 0|浏览10AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要