Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
7
Yuan Sui
yuansui
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models
liked
a dataset
4 months ago
TIGER-Lab/TheoremQA
liked
a dataset
9 months ago
aburns4/WikiWeb2M
View all activity
Organizations
None yet
yuansui
's models
10
Sort: Recently updated
yuansui/llama3.1_8b_instruct_sft-v2
Updated
Sep 14, 2024
•
11
yuansui/llama3.1_8b_instruct_sft_dpo
Updated
Sep 14, 2024
•
32
yuansui/llama3.1_8b_instruct_sft
Updated
Sep 14, 2024
•
10
yuansui/llama-160m-PPO-tuned
Reinforcement Learning
•
Updated
Sep 11, 2024
•
10
yuansui/Meta-Llama-3.1-8B-Instruct-PPO-tuned
Reinforcement Learning
•
Updated
Sep 6, 2024
•
13
yuansui/TinyLLama-v0-PPO-tuned
Reinforcement Learning
•
Updated
Sep 6, 2024
•
8
yuansui/llama3-8b-instruct-PPO-tuned
Updated
Sep 6, 2024
yuansui/llama2_7b_instruct_sft_dpo
Text Generation
•
Updated
Aug 25, 2024
•
18
yuansui/bert-finetuned-ner-accelerate
Updated
Apr 12, 2022
yuansui/bert-finetuned-ner
Updated
Apr 12, 2022