Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nanowell
/
uniform-instructed-base-qwen-3b
like
1
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
uniform-instructed-base-qwen-3b
/
README.md
nanowell
Update README.md
7a2e3f4
verified
about 1 month ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
104 Bytes
Novel training procedure to deslopify instruct/assistant models.
No SFT.
Pure RL with a good signal.