Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
CohenQu
/
Joint-Train-deepscalar_RL_easy_500_verl_0.35_0.001_0.001_16_12k_4
like
0
Safetensors
qwen3
Model card
Files
Files and versions
Community
main
Joint-Train-deepscalar_RL_easy_500_verl_0.35_0.001_0.001_16_12k_4
Commit History
Training in progress, step 180
f512d0e
verified
CohenQu
commited on
5 days ago
Training in progress, step 150
dea24fe
verified
CohenQu
commited on
5 days ago
Training in progress, step 120
5b9c831
verified
CohenQu
commited on
5 days ago
Training in progress, step 90
d6b9bad
verified
CohenQu
commited on
5 days ago
Training in progress, step 60
810a4ba
verified
CohenQu
commited on
5 days ago
Training in progress, step 30
651e517
verified
CohenQu
commited on
5 days ago
Add tokenizer from base model
87c01fa
verified
CohenQu
commited on
5 days ago
initial commit
4e3a8ce
verified
CohenQu
commited on
5 days ago