Nanonets-OCR2-3B-4bit / README.md

ljoana

Upload folder using huggingface_hub

fe1396c verified 25 days ago

preview code

raw

history blame contribute delete

681 Bytes

metadata

language:
  - multilingual
base_model:
  - Qwen/Qwen2.5-VL-3B-Instruct
tags:
  - OCR
  - image-to-text
  - pdf2markdown
  - VQA
  - mlx
pipeline_tag: image-text-to-text
library_name: transformers

mlx-community/Nanonets-OCR2-3B-4bit

This model was converted to MLX format from nanonets/Nanonets-OCR2-3B using mlx-vlm version 0.3.3. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model mlx-community/Nanonets-OCR2-3B-4bit --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>