Adding Evaluation Results
#15 opened about 1 year ago
by
leaderboard-pr-bot
请问deepspeed zero3的参数是怎么配置的
1
#14 opened about 1 year ago
by
Aibet
loss震荡幅度比较大是正常的嘛,loss是在3个epoch的哪个时候开始下降并保持稳定的呢
3
#13 opened about 1 year ago
by
Aibet
请问eval data 应该如何获取
#11 opened over 1 year ago
by
endNone
模型效果超出预期,很棒!!
1
#8 opened over 1 year ago
by
oscar325
请问这个sft用到了哪些数据,总共是多少量级?
1
#7 opened over 1 year ago
by
Kuaixueshiqing
Why you have the most py files in the configfile ?
#3 opened over 1 year ago
by
Wulolx