Tony W's picture

10

Tony W

tonyaw

AI & ML interests

None yet

Organizations

None yet

New activity in nvidia/Llama-3.1-Nemotron-70B-Reward 3 months ago

Does the RL lead to this model to prefer to give answers in a certain length scope?

#4 opened 3 months ago by

New activity in joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4 3 months ago

Does the RL lead to this model to prefer to give answers in a certain length scope?

#1 opened 3 months ago by

New activity in nvidia/Llama-3.1-Nemotron-70B-Instruct-HF 3 months ago

Does the RL lead to this model to prefer to give answers in a certain length scope?

#65 opened 3 months ago by

New activity in ise-uiuc/Magicoder-S-DS-6.7B over 1 year ago

Incorrect vocab size?

#2 opened over 1 year ago by

New activity in HuggingFaceH4/starchat-alpha about 2 years ago

How to use PEFT+LoRA to fine-tune starchat-alpha

#17 opened about 2 years ago by