Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 13 days ago • 124
LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025 By kikikita and 1 other • 1 day ago • 12
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 195
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 11 days ago • 47
AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds By giadap and 2 others • 8 days ago • 7
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 4 days ago • 4
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 22 days ago • 34
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 19 days ago • 46
Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • 7 days ago • 3
Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 13 days ago • 124
LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025 By kikikita and 1 other • 1 day ago • 12
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 195
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 11 days ago • 47
AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds By giadap and 2 others • 8 days ago • 7
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 4 days ago • 4
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 22 days ago • 34
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 19 days ago • 46
Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • 7 days ago • 3