FuseAI
/

FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview

Model card Files Files and versions

Resources

View closed (3)

RL finetuning on this merge leads to model collapse

#11 opened 11 months ago by

非常喜欢这个模型

#9 opened 12 months ago by

Add comparison with 70B distilled R1 model

#8 opened 12 months ago by

Update model card

#7 opened about 1 year ago by

Temperature's effect on the performance of long chain reasoning models. Why was 0.7 used for the evals?

#6 opened about 1 year ago by

License of your model

#4 opened about 1 year ago by

Evaluation

#3 opened about 1 year ago by

Merge with 32b coder?

#2 opened about 1 year ago by