Om AI Lab

company

https://github.com/om-ai-lab

OmAI_lab

om-ai-lab

Activity Feed

AI & ML interests

Multimodal AI, Agents

Recent Activity

tianchez submitted a paper 1 day ago

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

kyusonglee updated a model about 2 months ago

omlab/opentrackvla-qwen06b

Zilun updated a dataset 4 months ago

omlab/SARDet_REC6_NORM-FS

View all activity

Papers

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

View all Papers

Articles

Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning

Mar 25, 2025

• 2

Improving Object Detection through Reinforcement Learning with VLM-R1

Mar 25, 2025

• 3

Organization Card

Community About org cards

Om AI Lab is a passionate group building multimodal AI agents that reshape our work and life.

Collections 4

View 4 collections

spaces 5

Open Agent Leaderboard

🥇

Open Agent Leaderboard

VLM R1 Referral Expression

💬

Mark regions in images based on text descriptions

OmAgent

💬

Process and answer questions about webpage videos

VLM R1 OVD

👁

VLM-R1 model for Open-Vocabulary Object Detection

models 9

datasets 12

omlab/SARDet_REC6_NORM-FS

Viewer • Updated Feb 4 • 968 • 16

omlab/SARDet_REC6-FS

Viewer • Updated Feb 4 • 968 • 6

omlab/SARDet3-FS

Viewer • Updated Feb 1 • 270 • 19

omlab/Cross_DIOR-RSVG

Viewer • Updated Oct 2, 2025 • 7.42k • 73

omlab/Cross_RRSIS-D

Viewer • Updated Oct 2, 2025 • 3.48k • 24

omlab/VRSBench-FS

Viewer • Updated Oct 2, 2025 • 16.6k • 315 • 1

omlab/NWPU-FS

Viewer • Updated Oct 2, 2025 • 39 • 32

omlab/EarthReason-FS

Viewer • Updated Oct 2, 2025 • 3.39k • 14

omlab/VLM-R1

Preview • Updated Apr 23, 2025 • 369 • 18

omlab/RS5M

Viewer • Updated Mar 16, 2025 • 7.25M • 2.1k • 1

View 12 datasets

AI & ML interests

Recent Activity

Papers

Articles

Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning

Improving Object Detection through Reinforcement Learning with VLM-R1

Team members 5

Collections 4

spaces 5 Sort: Recently updated

Open Agent Leaderboard

VLM R1 Referral Expression

OmAgent

VLM R1 OVD

models 9 Sort: Recently updated

datasets 12 Sort: Recently updated

spaces 5

models 9

datasets 12