Molmo-7B-D NF4 Quant Only the LLM portion was quantized, CLIP encoder remains as is
30GB -> 7GB
approx. 12GB VRAM required
base model for more information:
https://huggingface.co/allenai/Molmo-7B-D-0924
Chat template
Files info
Base model