Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

openbmb
/
MiniCPM-o-2_6

Any-to-Any
Transformers
Safetensors
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
Model card Files Files and versions
xet
Community
54
MiniCPM-o-2_6 / assets
69.8 MB
  • 14 contributors
History: 6 commits
3v324v23's picture
3v324v23
update readme, add audio input examples
7063fbc 8 months ago
  • chattts_tokenizer
    add assets 8 months ago
  • input_examples
    update readme, add audio input examples 8 months ago
  • Skiing.mp4
    8.53 MB
    xet
    add omni case for inference 8 months ago
  • Vocos.pt

    Detected Pickle imports (3)

    • "collections.OrderedDict",
    • "torch.FloatStorage",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    54.4 MB
    xet
    add assets 8 months ago
  • demo.wav
    1.45 MB
    xet
    update readme 8 months ago
  • mimick.wav
    384 kB
    xet
    update audio demo 8 months ago
  • qa.wav
    115 kB
    xet
    updata audio demo 8 months ago