Haoxiang Zhang's picture

Haoxiang Zhang

IPF

·

https://isaacghx.github.io/about

AI & ML interests

None yet

Recent Activity

commentedon a paper 1 day ago

RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation

upvoted a paper 1 day ago

RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation

updated a model 2 days ago

Eubiota/eubiota-14b-step22

View all activity

Organizations

commented a paper 1 day ago

RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation

Paper • 2606.11709 • Published 18 days ago • 1 •

commented a paper 24 days ago

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

Paper • 2606.00408 • Published 30 days ago • 65 •

New activity in IPF/AIME25-CoT-CN 10 months ago

Update README.md for meta data

#3 opened 10 months ago by