What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 7 days ago • 23
LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress) By neph1 • 6 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 201
Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 2 days ago • 6
What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2 By jasonhargrove • 2 days ago • 5
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation By Alibaba-DAMO-Academy and 9 others • about 2 hours ago • 5
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • May 7, 2024 • 93
OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?* By stefanwebb and 2 others • 1 day ago • 4
What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 7 days ago • 23
LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress) By neph1 • 6 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 201
Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 2 days ago • 6
What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2 By jasonhargrove • 2 days ago • 5
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation By Alibaba-DAMO-Academy and 9 others • about 2 hours ago • 5
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • May 7, 2024 • 93
OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?* By stefanwebb and 2 others • 1 day ago • 4