MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model Paper β’ 2602.06393 β’ Published Feb 6 β’ 3
Grounding World Simulation Models in a Real-World Metropolis Paper β’ 2603.15583 β’ Published 21 days ago β’ 153
Language-only Efficient Training of Zero-shot Composed Image Retrieval Paper β’ 2312.01998 β’ Published Dec 4, 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion Paper β’ 2303.11916 β’ Published Mar 21, 2023