The Prime X
Collection
math, code
•
5 items
•
Updated
BetaCeti-Beta-4B-Prime1 is a compact, coding-optimized language model built on the Qwen3-4B architecture, tailored for high-accuracy code generation, debugging, and technical reasoning. With 4 billion parameters, it strikes a balance between performance and efficiency, making it an ideal assistant for developers, educators, and engineers working in constrained environments or requiring fast inference.
File Name | Precision | Size |
---|---|---|
BetaCeti-Beta-4B-Prime1.BF16.gguf | BF16 | 8.05 GB |
BetaCeti-Beta-4B-Prime1.F16.gguf | FP16 | 8.05 GB |
BetaCeti-Beta-4B-Prime1.F32.gguf | FP32 | 16.1 GB |
BetaCeti-Beta-4B-Prime1.Q2_K.gguf | Q2_K | 1.67 GB |
BetaCeti-Beta-4B-Prime1.Q3_K_M.gguf | Q3_K_M | 2.08 GB |
BetaCeti-Beta-4B-Prime1.Q4_K_M.gguf | Q4_K_M | 2.50 GB |
BetaCeti-Beta-4B-Prime1.Q5_K_M.gguf | Q5_K_M | 2.89 GB |
BetaCeti-Beta-4B-Prime1.Q8_0.gguf | Q8_0 | 4.28 GB |
config.json | Config File | 31 Bytes |
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):
2-bit
3-bit
4-bit
5-bit
8-bit
16-bit
32-bit
Base model
Qwen/Qwen3-4B-Base