deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24 • 731k • • 1.31k
Jackrong/GPT-OSS-20B-Distilled-Reasoning-Mini Viewer • Updated 10 days ago • 1.96k • 569 • 16
Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published Mar 27 • 79
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published Jul 8 • 73
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training Paper • 2508.00414 • Published 20 days ago • 86
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17 • 245
Pre-Trained Policy Discriminators are General Reward Models Paper • 2507.05197 • Published Jul 7 • 39