-
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4
Text Generation • Updated • 57 • 1 -
neuralmagic/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation • Updated • 21 -
neuralmagic/Sparse-Llama-3.1-8B-2of4
Text Generation • Updated • 940 • 60 -
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation • Updated • 216 • 1
NM Testing
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
3
-
neuralmagic/Nous-Hermes-2-Yi-34B-pruned50
Text Generation • Updated • 15 -
neuralmagic/Nous-Hermes-2-Yi-34B-pruned2.4
Text Generation • Updated • 14 -
neuralmagic/Nous-Hermes-2-SOLAR-10.7B-pruned2.4
Text Generation • Updated • 19 -
neuralmagic/OpenHermes-2.5-Mistral-7B-pruned50
Text Generation • Updated • 152 • 1
models
287
nm-testing/TinyLlama-1.1B-Chat-v1.0-sparse2of4_only-e2e
Updated
•
16
nm-testing/TinyLlama-1.1B-Chat-v1.0-kv_cache_default_tinyllama-e2e
Updated
•
8
nm-testing/TinyLlama-1.1B-Chat-v1.0-sparse2of4_fp8_dynamic-e2e
Updated
•
4
nm-testing/Phi-3-mini-4k-instruct-kv_cache_default_phi3-e2e
Updated
•
8
nm-testing/TinyLlama-1.1B-Chat-v1.0-sparse2of4_fp8_dyname-e2e
Updated
nm-testing/llama2.c-stories42M-quantized-fp8-Dynamic
Updated
•
5
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-uncompressed
Updated
•
10
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-compressed
Updated
•
23
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-uncompressed
Updated
•
15
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-compressed
Updated
•
31