Jamessmun

jamessmun

19 25

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

hariikk/gt

upvoted a paper 11 days ago

ShortOPD: Recovering Pruned LLMs with Short-to-Long On-Policy Distillation

upvoted a paper 13 days ago

Trust Region Policy Distillation

View all activity

Organizations

None yet

liked a dataset 6 days ago

hariikk/gt

Updated 16 minutes ago • 2.74k • 1

upvoted a paper 11 days ago

ShortOPD: Recovering Pruned LLMs with Short-to-Long On-Policy Distillation

Paper • 2607.13124 • Published 17 days ago • 19

upvoted a paper 13 days ago

Trust Region Policy Distillation

Paper • 2607.04751 • Published 25 days ago • 35

upvoted a paper 15 days ago

UP: Unbounded Positive Asymmetric Optimization for Breaking the Exploration-Stability Dilemma

Paper • 2607.06987 • Published 23 days ago • 9

liked a model 19 days ago

koivualeksi/sanitizer

Updated 19 days ago • 2

liked a model 24 days ago

Walles777/swahili-english-translator

1.51M • Updated 24 days ago • 36 • 1

liked a dataset 28 days ago

banned-historical-archives/banned-historical-archives

Viewer • Updated Oct 19, 2025 • 1 • 1.33M • 65

liked a model about 1 month ago

anggasspm/infravision-backend

Updated Jun 20 • 1

liked 2 datasets about 2 months ago

jacob-valdez/synthux-economy-r9-w141

Viewer • Updated Jun 3 • 1.03k • 20 • 1

cadene/droid_1.0.1

Updated Mar 20, 2025 • 326k • 51

liked a dataset 2 months ago

HaptalAI/robolang

Viewer • Updated Jun 9 • 600 • 77 • 1

liked a Space 2 months ago

ProtectBirds

🏃

892

Protect Birds

liked a model 2 months ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated May 26 • 162k • • 1.16k

upvoted 2 papers 2 months ago

Boosting Omni-Modal Language Models: Staged Post-Training with Visually Debiased Evaluation

Paper • 2605.12034 • Published May 13 • 6

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Paper • 2605.13641 • Published May 13 • 51

liked a model 2 months ago

Catter58/CASELLM-8b-evaluation

8B • Updated May 18 • 6 • 1

liked a model 3 months ago

Serkan007/moondream2-multiformat

Updated May 14 • 495 • 1

upvoted a paper 3 months ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 116

liked a model 3 months ago

Bhuvanesh0195/phi35-sap-ax-combined-gguf

4B • Updated May 7 • 8 • 1

upvoted a paper 3 months ago

Improving Robustness of Tabular Retrieval via Representational Stability

Paper • 2604.24040 • Published Apr 27 • 3

Jamessmun

AI & ML interests

Recent Activity

Organizations

jamessmun's activity

ProtectBirds