4-bit OmniQuant quantized version of Dolphin3.0-Llama3.2-3B for inference with Private LLM.
Base model