Kai Zheng's picture

Kai Zheng

tangmen

·

AI & ML interests

None yet

Recent Activity

submitted a paper 4 days ago

VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct

authored a paper 5 days ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

authored a paper 5 days ago

OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents

View all activity

Organizations

Papers 9

arxiv:2606.23543

arxiv:2603.01571

arxiv:2603.01562

arxiv:2601.18467

models 5

tangmen/zephyr-7b-dpo-qlora

Updated Jan 21, 2024

tangmen/zephyr-7b-dpo-full

Updated Jan 21, 2024

tangmen/WizardVerseV1

Updated Dec 7, 2023

tangmen/WizardVerse

Updated Dec 7, 2023

tangmen/chatV

Updated Oct 3, 2023

datasets 0

None public yet