KABI's picture

KABI

dongguanting

·

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 2 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

upvoted an article 15 days ago

Create, Evaluate, and Connect AI Skills | SkillNet: A Large-Scale Agentic "Skill Graph" Knowledge Base

new activity 16 days ago

dongguanting/Tool-Star-Qwen-1.5B:Update README.md

View all activity

Organizations

dongguanting 's models 16

dongguanting/Tool-Star-Qwen-1.5B

Text Generation • 2B • Updated 16 days ago • 23 • 2

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated Dec 20, 2025 • 1 • 2

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated Dec 20, 2025 • 1 • 1

dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated Dec 20, 2025 • 2 • 1

dongguanting/aepo_light

8B • Updated Nov 3, 2025 • 1

dongguanting/Qwen2.5-7B-AEPO

Text Generation • 8B • Updated Oct 27, 2025 • 5 • 1

dongguanting/Qwen3-14B-AEPO-DeepSearch

Robotics • 15B • Updated Oct 21, 2025 • 5 • 1

dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19, 2025 • 5 • 2

dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12, 2025 • 1 • 1

dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12, 2025 • 6 • 3

dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12, 2025 • 1 • 5

dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29, 2025 • 2.52k • 2

dongguanting/Tool-Star-Qwen-7B

Text Generation • 8B • Updated Jun 30, 2025 • 7 • 2

dongguanting/RAG-Critic-3B

Text Generation • 3B • Updated Jun 28, 2025 • 3 • 4

dongguanting/Tool-Star-Qwen-0.5B

Text Generation • 0.6B • Updated Jun 6, 2025 • 5 • 1

dongguanting/Tool-Star-Qwen-3B

Text Generation • 3B • Updated May 25, 2025 • 7 • 5