view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ 2 days ago β’ 73
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ 5 days ago β’ 298
Scalable Chain of Thoughts via Elastic Reasoning Paper β’ 2505.05315 β’ Published 9 days ago β’ 23
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper β’ 2504.17432 β’ Published 23 days ago β’ 38
π March 2025 - Open releases from the Chinese community Collection 32 items β’ Updated about 22 hours ago β’ 13
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents Paper β’ 2503.01935 β’ Published Mar 3 β’ 27
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper β’ 2503.04872 β’ Published Mar 6 β’ 15
view article Article π#89: AI in Action: How AI Engineers, Self-Optimizing Models, and Humanoid Robots Are Reshaping 2025 By Kseniase β’ Feb 25 β’ 4
view article Article What is test-time compute and how to scale it? By Kseniase and 1 other β’ Feb 6 β’ 86
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 229
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.24k
view article Article How to deploy and fine-tune DeepSeek models on AWS By pagezyhf and 2 others β’ Jan 30 β’ 52
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other β’ Jan 16 β’ 73
view article Article Topic 23: What is LLM Inference, it's challenges and solutions for it By Kseniase β’ Jan 17 β’ 5
Centurio Collection Artifacts of the paper "Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model" β’ 6 items β’ Updated Feb 4 β’ 4