Feb 01, 2024 We released Aligner: a new efficient alignment paradigm, bypasses the whole RLHF process.
Jan 16, 2024 Two papers get accepted to ICLR 2024. Safe RLHF (Spotlight), SafeDreamer.
Dec 05, 2023 One paper get accepted to JMLR 2023! Heterogeneous-Agent Reinforcement Learning.
Nov 01, 2023 Big News! We released AI Alignment: A Comprehensive Survey.
Oct 21, 2023 We released Safe RLHF: Safe Reinforcement Learning from Human Feedback.
GitHub Repo Stars AK's Daily Papers
Oct 14, 2023 We released SafeDreamer: a novel algorithm for low-dimensional and vision-only safety tasks.
Sep 25, 2023 I contributed to Baichuan’s model fine-tuning in RLHF as a core member and earned 1W+ ⭐.
Baichuan-7B GitHub Repo Stars Baichuan-13B GitHub Repo Stars Baichuan2 GitHub Repo Stars
Sep 22, 2023 Three papers get accepted to NeurIPS 2023!
May 16, 2023 We released Safe-RLHF: Constrained Value Alignment for LLMs.
GitHub Repo Stars 机器之心报道:国内首个可复现的RLHF基准,北大团队开源 PKU-Beaver
Jan 27, 2023 We released Safety-Gymnasium: a highly scalable safeRL environment library.
GitHub Repo Stars
Nov 17, 2022 First Place in NeurIPS 2022 Challenge Track, MyoChallenge Page
相关报道: [北京大学前沿计算研究中心] [北京大学人工智能研究院] [北京大学] [中国青年报]
Oct 20, 2022 We released OmniSafe: An Infrastructure for Accelerating SafeRL Research.
GitHub Repo Stars