news
Feb 01, 2024 |
We released Aligner: a new efficient alignment paradigm, bypasses the whole RLHF process.
无需RLHF显著提升GPT-4/Llama2性能,北大团队提出Aligner对齐新范式 |
---|---|
Jan 16, 2024 | Two papers get accepted to ICLR 2024. Safe RLHF (Spotlight), SafeDreamer. |
Dec 05, 2023 | One paper get accepted to JMLR 2023! Heterogeneous-Agent Reinforcement Learning. |
Nov 01, 2023 | Big News! We released AI Alignment: A Comprehensive Survey. |
Oct 21, 2023 |
We released Safe RLHF: Safe Reinforcement Learning from Human Feedback.
|
Oct 14, 2023 | We released SafeDreamer: a novel algorithm for low-dimensional and vision-only safety tasks. |
Sep 25, 2023 |
I contributed to Baichuan’s model fine-tuning in RLHF as a core member and earned 1W+ ⭐.
Baichuan-7B |
Sep 22, 2023 | Three papers get accepted to NeurIPS 2023! |
May 16, 2023 |
We released Safe-RLHF: Constrained Value Alignment for LLMs.
|
Jan 27, 2023 |
We released Safety-Gymnasium: a highly scalable safeRL environment library.
|
Nov 17, 2022 |
First Place in NeurIPS 2022 Challenge Track, MyoChallenge Page
相关报道: [北京大学前沿计算研究中心] [北京大学人工智能研究院] [北京大学] [中国青年报] |
Oct 20, 2022 |
We released OmniSafe: An Infrastructure for Accelerating SafeRL Research.
|