Yuanming-Li
Lymann
AI & ML interests
Computer Vision; Action Understanding
Recent Activity
upvoted a paper 1 day ago
Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions upvoted a paper 3 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper 3 days ago
Rethinking the Divergence Regularization in LLM RL