Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
41
222
53
KABI
dongguanting
Follow
vanillaOVO's profile picture
AicyXxgzbd's profile picture
varuy322's profile picture
68 followers
·
106 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
about 16 hours ago
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
commented
on
a paper
about 20 hours ago
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
upvoted
a
paper
about 20 hours ago
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
View all activity
Organizations
dongguanting
's datasets
11
Sort: Recently updated
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Oct 17, 2025
•
1.07k
•
67
•
6
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
Oct 17, 2025
•
10k
•
143
•
4
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
Oct 17, 2025
•
54.6k
•
149
•
14
dongguanting/RAG-Error-Critic-100K
Viewer
•
Updated
Jun 28, 2025
•
100k
•
26
•
3
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
May 29, 2025
•
54k
•
202
•
10
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
May 25, 2025
•
10k
•
81
•
5
dongguanting/RAG-QA-40K
Viewer
•
Updated
Dec 27, 2024
•
32.8k
•
42
•
2
dongguanting/ShareGPT-12K
Viewer
•
Updated
Dec 27, 2024
•
12.9k
•
79
•
1
dongguanting/VIF-RAG-QA-110K
Viewer
•
Updated
Dec 27, 2024
•
111k
•
47
•
7
dongguanting/DotamathQA
Viewer
•
Updated
Dec 26, 2024
•
574k
•
67
•
2
dongguanting/VIF-RAG-QA-20K
Viewer
•
Updated
Nov 1, 2024
•
20k
•
7
•
4