-
Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction
Paper • 2602.03414 • Published -
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
Paper • 2603.09206 • Published • 53 -
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
Paper • 2602.21320 • Published • 12 -
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch
Paper • 2601.13606 • Published • 11
Collections
Discover the best community collections!
Collections including paper arxiv:2603.09206
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 419 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 321 -
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Paper • 2512.23988 • Published • 19 -
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Paper • 2512.25075 • Published • 15 -
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Paper • 2512.24176 • Published • 8
-
ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems
Paper • 2503.20756 • Published • 7 -
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset
Paper • 2505.09568 • Published • 99 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 217 -
Qwen3-Omni Technical Report
Paper • 2509.17765 • Published • 153
-
Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction
Paper • 2602.03414 • Published -
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
Paper • 2603.09206 • Published • 53 -
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
Paper • 2602.21320 • Published • 12 -
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch
Paper • 2601.13606 • Published • 11
-
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 321 -
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Paper • 2512.23988 • Published • 19 -
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Paper • 2512.25075 • Published • 15 -
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Paper • 2512.24176 • Published • 8
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 419 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems
Paper • 2503.20756 • Published • 7 -
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset
Paper • 2505.09568 • Published • 99 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 217 -
Qwen3-Omni Technical Report
Paper • 2509.17765 • Published • 153