Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Paper • 2502.02533 • Published Feb 4 • 2
Reward Bench 2 Collection Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated 11 days ago • 11
Dreamland: Controllable World Creation with Simulator and Generative Models Paper • 2506.08006 • Published 4 days ago • 7
view article Article From OpenAI to Open LLMs with Messages API By andrewrreed and 3 others • Feb 8, 2024 • 20
view article Article KV Cache from scratch in nanoVLM By ariG23498 and 4 others • 10 days ago • 67
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published Apr 30 • 46
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 152
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 287
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 300
NV-Embed Collection NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated 2 days ago • 14
SimpleRL-Zoo Collection The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 13 items • Updated May 5 • 7
A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published Mar 20 • 49
view article Article SmolVLM2: Bringing Video Understanding to Every Device By orrzohar and 6 others • Feb 20 • 266