Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
baidu 's Collections
ERNIE-Image
ERNIE 4.5
Qianfan-VL

Qianfan-VL

updated Mar 18

Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios.

Upvote
29

  • baidu/Qianfan-OCR

    Image-Text-to-Text • 5B • Updated Apr 29 • 240k • 1.18k

  • baidu/Qianfan-VL-70B

    Image-Text-to-Text • 72B • Updated Apr 19 • 71 • 39

  • baidu/Qianfan-VL-8B

    Image-Text-to-Text • 9B • Updated Apr 19 • 8.02k • 41

  • baidu/Qianfan-VL-3B

    Image-Text-to-Text • 4B • Updated Sep 19, 2025 • 146 • 28

  • Running
    Agents
    8

    Qianfan VL Demo

    💬
    8

    Domain-Enhanced Universal Vision-Language Models

Upvote
29
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs