yuansui
AI & ML interests
None yet
Organizations
None yet
yuansui/llama3.1_8b_instruct_sft-v2
8B
•
Updated
•
2
yuansui/llama3.1_8b_instruct_sft_dpo
8B
•
Updated
•
2
yuansui/llama3.1_8b_instruct_sft
8B
•
Updated
•
2
yuansui/llama-160m-PPO-tuned
Reinforcement Learning
•
Updated
•
3
yuansui/Meta-Llama-3.1-8B-Instruct-PPO-tuned
Reinforcement Learning
•
Updated
•
2
yuansui/TinyLLama-v0-PPO-tuned
Reinforcement Learning
•
Updated
•
2
yuansui/llama3-8b-instruct-PPO-tuned
Updated
yuansui/llama2_7b_instruct_sft_dpo
Text Generation
•
7B
•
Updated
•
2
yuansui/bert-finetuned-ner-accelerate
Updated
yuansui/bert-finetuned-ner
Updated