Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deepakkumar07 's Collections
vision-llm
tamil-dataset
document-parser
text-to-speech
voice-to-text
Transformers model
csv-dataset

vision-llm

updated about 3 hours ago
Upvote
-

  • Running
    102
    102

    Vision Papers

    💻

    All paper summaries read by Merve


  • Running on Zero
    18
    18

    Ovis2 1B

    🦫

    Small model can do big things.


  • AIDC-AI/Ovis2-8B-GPTQ-Int4

    Image-Text-to-Text • Updated Mar 25 • 306 • 2

  • AIDC-AI/Ovis2-1B

    Image-Text-to-Text • Updated Feb 27 • 18.5k • 87

  • Running on Zero
    12
    12

    Ovis2 8B

    🦫

    Ovis2-8B


  • lambdalabs/Llama-3.3-70B-Instruct-AWQ-4bit

    Updated Dec 10, 2024 • 1.93k • 4

  • microsoft/GUI-Actor-7B-Qwen2-VL

    Image-Text-to-Text • Updated 9 days ago • 819 • 29

  • lambdalabs/sd-image-variations-diffusers

    Image-to-Image • Updated Feb 8, 2023 • 5.22k • 445
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs