arxiv:2411.02355
Eldar Kurtić
ekurtic
AI & ML interests
Efficient inference
Recent Activity
updated
a model
2 days ago
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
published
a model
2 days ago
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16
updated
a model
3 days ago
nm-testing/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16