LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model Paper • 2604.02097 • Published 2 days ago • 20
Running on CPU Upgrade 118 daVinci-MagiHuman 🎬 118 Generate short videos from an image and text prompt
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 12 days ago • 120
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 12 days ago • 120
Running on CPU Upgrade 118 daVinci-MagiHuman 🎬 118 Generate short videos from an image and text prompt
Running on CPU Upgrade 118 daVinci-MagiHuman 🎬 118 Generate short videos from an image and text prompt
Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training Paper • 2602.07824 • Published Feb 8 • 18
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security Paper • 2601.18491 • Published Jan 26 • 125
AgentDoG Collection A Diagnostic Guardrail Framework for AI Agent Safety and Security • 9 items • Updated 11 days ago • 107
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently Paper • 2602.02619 • Published Feb 2 • 53
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing Paper • 2602.02437 • Published Feb 2 • 80
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models Paper • 2601.22060 • Published Jan 29 • 155