Text Generation
Safetensors
qwen2
conversational

image/png

Pushing the limits of the Zeta-2-Dataset. New: Zeta 3

Zeta 3 is a new LLM that challenges itself to outperform Zeta-2, even though it uses the same Zeta-2-Dataset.

Ollama: DiamondGotCat/Zeta-3

Quantized Model (GGUF)

Prompt Template

{{ if .System }}{{ .System }}{{ end }}
{{ if .Prompt }}<USER>{{ .Prompt }}</USER>{{ end }}
<ASSISTANT>

Stop Token

</ASSISTANT>

Computer Spec

Machine: RunPod VM(GPU, NVIDIA A100 PCIe)

This time, I used the RunPod service to study more efficiently.

Thanks to RunPod, I was able to use CUDA and had the optimization options available.

Dataset

Details of the dataset used can be found here

Links

GitHub: Zeta


Zeta is just a small SLM. But don't forget that it has big dreams inside.

Downloads last month
3
Safetensors
Model size
464M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Zeta-LLM/Zeta-3

Quantizations
2 models

Dataset used to train Zeta-LLM/Zeta-3