Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AXERA-TECH 's Collections
Multimodal Models
Qwen2.5
MiniCPM4
Qwen3
DeepSeek-R1-Distill
HuggingFaceTB
Vision Models
Audio Models
Tools
TestData

Multimodal Models

updated 4 days ago
Upvote
-

  • AXERA-TECH/lcm-lora-sdv1-5

    Updated Jun 23 • 7 • 1

  • AXERA-TECH/InternVL3-2B

    Visual Question Answering • Updated 12 days ago • 15 • 2

  • AXERA-TECH/Qwen2.5-VL-3B-Instruct

    Image-Text-to-Text • Updated 10 days ago • 19

  • AXERA-TECH/InternVL3-1B

    Image-Text-to-Text • Updated Jun 28 • 9

  • AXERA-TECH/SmolVLM2-500M-Video-Instruct

    Visual Question Answering • Updated Jul 14 • 6 • 2

  • AXERA-TECH/InternVL2_5-1B-MPO

    Image-Text-to-Text • Updated 8 days ago • 6

  • AXERA-TECH/InternVL2_5-1B

    Image-Text-to-Text • Updated Apr 4 • 5 • 1

  • AXERA-TECH/Janus-Pro-1B

    Visual Question Answering • Updated Apr 14 • 5 • 2

  • AXERA-TECH/SmolVLM-256M-Instruct

    Updated Apr 4 • 16 • 2

  • AXERA-TECH/YOLO-World-V2

    Object Detection • Updated Mar 23 • 6

  • AXERA-TECH/LivePortrait

    Image-to-Video • Updated Jun 21 • 2 • 4

  • AXERA-TECH/cnclip

    Updated 12 days ago • 6 • 1

  • AXERA-TECH/clip

    Updated 12 days ago • 5

  • AXERA-TECH/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • Updated 10 days ago • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs