Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nanowell
/
uniform-instructed-base-qwen-3b
like
1
Safetensors
qwen2
Model card
Files
Files and versions
Community
nanowell
commited on
Mar 20
Commit
7a2e3f4
·
verified
·
1 Parent(s):
37c4d38
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+3
-2
README.md
CHANGED
Viewed
@@ -1,5 +1,6 @@
1
-
Novel training procedure to deslopify models.
2
3
No SFT.
4
5
-
Pure RL with a good signal.
1
+
Novel training procedure to deslopify
instruct/assistant
models.
2
3
No SFT.
4
5
+
Pure RL with a good signal.
6
+