Text Generation
Transformers
GGUF
English
esper
esper-3
valiant
valiant-labs
qwen
qwen-3
qwen-3-4b
4b
reasoning
code
code-instruct
python
javascript
dev-ops
jenkins
terraform
scripting
powershell
azure
aws
gcp
cloud
problem-solving
architect
engineer
developer
creative
analytical
expert
rationality
conversational
chat
instruct
llama-cpp
gguf-my-repo
Update README.md
Browse files
README.md
CHANGED
@@ -51,6 +51,14 @@ license: apache-2.0
|
|
51 |
This model was converted to GGUF format from [`ValiantLabs/Qwen3-4B-Esper3`](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
52 |
Refer to the [original model card](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3) for more details on the model.
|
53 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
54 |
## Use with llama.cpp
|
55 |
Install llama.cpp through brew (works on Mac and Linux)
|
56 |
|
|
|
51 |
This model was converted to GGUF format from [`ValiantLabs/Qwen3-4B-Esper3`](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
52 |
Refer to the [original model card](https://huggingface.co/ValiantLabs/Qwen3-4B-Esper3) for more details on the model.
|
53 |
|
54 |
+
---
|
55 |
+
Esper 3 is a coding, architecture, and DevOps reasoning specialist built on Qwen 3.
|
56 |
+
|
57 |
+
- Finetuned on our DevOps and architecture reasoning and code reasoning data generated with Deepseek R1!
|
58 |
+
- Improved general and creative reasoning to supplement problem-solving and general chat performance.
|
59 |
+
- Small model sizes allow running on local desktop and mobile, plus super-fast server inference!
|
60 |
+
|
61 |
+
---
|
62 |
## Use with llama.cpp
|
63 |
Install llama.cpp through brew (works on Mac and Linux)
|
64 |
|