arxiv:2606.23543
Kai Zheng
tangmen
AI & ML interests
None yet
Recent Activity
submitted a paper about 18 hours ago
VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct authored a paper about 23 hours ago
RubricBench: Aligning Model-Generated Rubrics with Human Standards authored a paper about 23 hours ago
OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents