Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
VITA-MLLM 's Collections
VITA-Audio
Long-VITA

VITA-Audio

updated 4 days ago
Upvote
1

  • VITA-MLLM/VITA-Audio-Boost

    Updated 15 days ago • 16 • 1

  • VITA-MLLM/VITA-Audio-Balance

    Updated 15 days ago • 18 • 1

  • VITA-MLLM/VITA-Audio-Plus-Vanilla

    Updated 6 days ago • 114 • 2

  • VITA-MLLM/VITA-Audio-Data

    Preview • Updated 4 days ago • 113 • 1

  • VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    Paper • 2505.03739 • Published 6 days ago • 8
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs