Chrome Extension
WeChat Mini Program
Use on ChatGLM

Adversarial Preference Learning for Robust LLM Alignment

Yuanfu Wang, Pengyu Wang, Chenyang Xi,Bo Tang, Junyi Zhu, Wenqiang Wei, Chen, Chao Yang,Jingfeng Zhang, Chaochao Lu, Yijun Niu, Keming Mao, Zhiyu Li,Feiyu Xiong,Jie Hu, Mingchuan Yang

arxiv(2025)

Cited 0|Views1
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined