dougiefresh
/

jade_qwen3_4b

Model card Files Files and versions Community

dougiefresh commited on May 30

Commit

c9f05ab

·

verified ·

1 Parent(s): b7a9ef1

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -72,4 +72,6 @@ weighting the knowledge dataset `1.5` and the identity dataset `0.5` and setting
 The result is a fast Qwen 3 model that seems to retain the updated knowledge base it was trained on while lacking a lot of the personality I hoped for. I'm currently researching ways to weight the identity data more optimally. I've also noticed the model can get a little manpages obsessed with a focus on Perl (unfortunately) as the bulk of the manpages generated on my system (me, a non-perl tool using developer, oh my god how much of what we do touches perl at some point) are for Perl documentation.
 ❤️

 The result is a fast Qwen 3 model that seems to retain the updated knowledge base it was trained on while lacking a lot of the personality I hoped for. I'm currently researching ways to weight the identity data more optimally. I've also noticed the model can get a little manpages obsessed with a focus on Perl (unfortunately) as the bulk of the manpages generated on my system (me, a non-perl tool using developer, oh my god how much of what we do touches perl at some point) are for Perl documentation.
+I've made [8bit](https://huggingface.co/dougiefresh/jade_qwen3_4b_mlx_8bit) and [4bit](https://huggingface.co/dougiefresh/jade_qwen3_4b_mlx_4bit) MLX quantizations available of this bf16 model.
 ❤️