Yuanming-Li's picture

Open to Work

Yuanming-Li

Lymann

·

AI & ML interests

Computer Vision; Action Understanding

Recent Activity

upvoted a paper 1 day ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

upvoted a paper 3 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper 3 days ago

Rethinking the Divergence Regularization in LLM RL

View all activity

Organizations

New activity in ByteDance-Seed/Seed-X-RM-7B 4 months ago

model.load_state_dict(state_dict, strict=False) is very slow! Why?

#3 opened 11 months ago by