view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 81
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 28 days ago • 606
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published Jul 1 • 72
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published May 30 • 133
view article Article Mitigating False Negatives in Multiple Negatives Ranking Loss for Retriever Training By dragonkue • May 25 • 11
Korean Embedding Models Collection A collection of embedding models that are optimized for understanding and representing Korean text. • 4 items • Updated Jun 10 • 2