llm-jp/optimal-sparsity-math-d2048-E32-k2-13.6B-A1.5B Text Generation • 14B • Updated 5 days ago • 20
llm-jp/optimal-sparsity-math-d2048-E64-k2-26.4B-A1.5B Text Generation • 26B • Updated 5 days ago • 18
llm-jp/optimal-sparsity-math-d2048-E32-k16-13.6B-A7.1B Text Generation • 14B • Updated 5 days ago • 19
llm-jp/optimal-sparsity-math-d2048-E32-k8-13.6B-A3.9B Text Generation • 14B • Updated 5 days ago • 19
llm-jp/optimal-sparsity-math-d2048-E64-k16-26.4B-A7.1B Text Generation • 26B • Updated 5 days ago • 16
llm-jp/optimal-sparsity-math-d2048-E128-k2-52.2B-A1.5B Text Generation • 52B • Updated 5 days ago • 22
llm-jp/optimal-sparsity-math-d2048-E64-k8-26.4B-A3.9B Text Generation • 26B • Updated 5 days ago • 18
llm-jp/optimal-sparsity-math-d1024-E128-k16-13.2B-A1.9B Text Generation • 13B • Updated 5 days ago • 26
llm-jp/optimal-sparsity-math-d2048-E128-k16-52.2B-A7.1B Text Generation • 52B • Updated 5 days ago • 22
llm-jp/optimal-sparsity-math-d512-E16-k16-520M-A520M Text Generation • 0.5B • Updated 5 days ago • 18