FuseAI
/

FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview

Model card Files Files and versions

Resources

View closed (3)

RL finetuning on this merge leads to model collapse

#11 opened 5 months ago by

非常喜欢这个模型

#9 opened 6 months ago by

Add comparison with 70B distilled R1 model

#8 opened 6 months ago by

Update model card

#7 opened 6 months ago by

Temperature's effect on the performance of long chain reasoning models. Why was 0.7 used for the evals?

#6 opened 7 months ago by

License of your model

#4 opened 7 months ago by

Evaluation

#3 opened 7 months ago by

Merge with 32b coder?

#2 opened 7 months ago by