arxiv:2310.12773
Ruiyang Sun
RuiyangSun
AI & ML interests
RL
Recent Activity
liked a dataset about 1 month ago
programbench/ProgramBench-Tests authored a paper over 2 years ago
Safe RLHF: Safe Reinforcement Learning from Human Feedback authored a paper almost 3 years ago
Baichuan 2: Open Large-scale Language Models