Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
5
7
Hejian Sang
pb09204048
Follow
Jibbscript's profile picture
webxos's profile picture
m0m0chen's profile picture
10 followers
ยท
6 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
28 days ago
On-Policy Self-Distillation for Reasoning Compression
submitted
a paper
28 days ago
On-Policy Self-Distillation for Reasoning Compression
authored
a paper
about 1 month ago
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
View all activity
Organizations
Articles
1
Article
69
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
Papers
2
arxiv:
2602.21420
arxiv:
2510.00237
models
0
None public yet
datasets
0
None public yet