Submitted by
Chaojun XIAO
AI & ML interests
Large Language Models
Recent Activity
View all activity
Papers
Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning
Rethinking the Role of Efficient Attention in Hybrid Architectures