Text2Reward: Automated Dense Reward Function Generation for Reinforcement LearningTianbao Xie,Siheng Zhao,Chen Wu,Yitao Liu, Qing Luo,Victor Zhong, Y.B. Yang,Yu ChenarXiv (Cornell University)(2023)引用 5|浏览43关键词Reinforcement Learning,RefactoringAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要