nanowell
/

uniform-instructed-base-qwen-3b

Model card Files Files and versions Community

uniform-instructed-base-qwen-3b / README.md

nanowell's picture

Update README.md

7a2e3f4 verified about 1 month ago

|

history blame contribute delete

104 Bytes

Novel training procedure to deslopify instruct/assistant models.

No SFT.

Pure RL with a good signal.