Tony W
tonyaw
AI & ML interests
None yet
Organizations
None yet
Does the RL lead to this model to prefer to give answers in a certain length scope?
#4 opened 3 months ago
by
tonyaw
Does the RL lead to this model to prefer to give answers in a certain length scope?
#1 opened 3 months ago
by
tonyaw
Does the RL lead to this model to prefer to give answers in a certain length scope?
#65 opened 3 months ago
by
tonyaw
Incorrect vocab size?
👍
2
12
#2 opened over 1 year ago
by
claudiuv
How to use PEFT+LoRA to fine-tune starchat-alpha
1
#17 opened about 2 years ago
by
tonyaw