HuggingFaceTB/SmolVLM2-500M-Video-Instruct-mlx

This model was converted to MLX format from HuggingFaceTB/SmolVLM2-500M-Video-Instruct using mlx-vlm version 0.1.13. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/SmolVLM2-500M-Video-Instruct-mlx-8bit-skip-vision --image https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/bee.jpg --prompt "Can you describe this image?"
Downloads last month
6
Safetensors
Model size
205M params
Tensor type
FP16
·
U32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support video-text-to-text models for transformers library.

Model tree for mlx-community/SmolVLM2-500M-Video-Instruct-mlx-8bit-skip-vision

Finetuned
(39)
this model

Datasets used to train mlx-community/SmolVLM2-500M-Video-Instruct-mlx-8bit-skip-vision