open-r1
/

OpenR1-Qwen-7B

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Resources

View closed (1)

gradient_accumulation_steps/batchsize

#7 opened 5 months ago by

I'm removing this model from my HDD and this is the reason.

#6 opened 5 months ago by

sft time

#5 opened 5 months ago by

About Training Detail

#4 opened 5 months ago by

different max_position_embeddings and rope_theta in and OpenR1-Qwen-7B-SFT and it's base Qwen2.5-Math-7B-Instruct ?

#3 opened 5 months ago by

About initial Model

#2 opened 6 months ago by

training code

#1 opened 6 months ago by