yssss's picture

1 4 8

yssss

yuansui

·

AI & ML interests

None yet

Recent Activity

liked a model 27 days ago

Qwen/Qwen3-30B-A3B-Instruct-2507

upvoted an article about 1 month ago

We Got Claude to Fine-Tune an Open Source LLM

authored a paper about 1 month ago

Table Meets LLM: Can Large Language Models Understand Structured Table Data? A Benchmark and Empirical Study

View all activity

Organizations

yuansui 's models 19

yuansui/wia-qwen3-14b-grpo

15B • Updated Aug 22, 2025 • 1

yuansui/wia-qwen3-8b-sft-epoch2

8B • Updated Aug 22, 2025 • 4

yuansui/wia-qwen3-8b-sft-epoch2-grpo-440steps

8B • Updated Aug 22, 2025 • 1

yuansui/wia-qwen3-14b-sft-epoch3-grpo-460steps

15B • Updated Aug 22, 2025 • 3

yuansui/wia-qwen3-8b-grpo

8B • Updated Aug 22, 2025 • 2

yuansui/wia-qwen3-14b-sft-epoch3

15B • Updated Aug 22, 2025 • 1

yuansui/qwen3-14b

Text Generation • 15B • Updated Aug 22, 2025 • 1

yuansui/wia-qwen3-14b-sft-epoch2

15B • Updated Aug 22, 2025 • 2

yuansui/qwen3-8b

Text Generation • 8B • Updated Aug 22, 2025 • 2

yuansui/llama3.1_8b_instruct_sft-v2

8B • Updated Sep 14, 2024 • 2

yuansui/llama3.1_8b_instruct_sft_dpo

8B • Updated Sep 14, 2024 • 2

yuansui/llama3.1_8b_instruct_sft

8B • Updated Sep 14, 2024 • 5

yuansui/llama-160m-PPO-tuned

Reinforcement Learning • Updated Sep 11, 2024 • 5

yuansui/Meta-Llama-3.1-8B-Instruct-PPO-tuned

Reinforcement Learning • Updated Sep 6, 2024 • 3

yuansui/TinyLLama-v0-PPO-tuned

Reinforcement Learning • Updated Sep 6, 2024 • 1

yuansui/llama3-8b-instruct-PPO-tuned

Updated Sep 6, 2024

yuansui/llama2_7b_instruct_sft_dpo

Text Generation • 7B • Updated Aug 25, 2024 • 2

yuansui/bert-finetuned-ner-accelerate

Updated Apr 12, 2022

yuansui/bert-finetuned-ner

Updated Apr 12, 2022