arxiv:2511.04670
Shusheng Yang PRO
ShushengYang
AI & ML interests
computer vision, vision language model
Recent Activity
updated a dataset about 8 hours ago
nyu-visionx/VSI-590K-MetaInfo published a dataset about 8 hours ago
nyu-visionx/VSI-590K-MetaInfo upvoted a paper about 1 month ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining