Executable Code Actions Elicit Better LLM Agents Paper β’ 2402.01030 β’ Published Feb 1, 2024 β’ 150
view article Article Introducing smolagents: simple agents that write actions in code. By m-ric and 2 others β’ Dec 31, 2024 β’ 1.06k
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 291
view article Article Decoding Strategies in Large Language Models By mlabonne β’ Oct 29, 2024 β’ 67
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others β’ Jul 16, 2024 β’ 378
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper β’ 2406.14491 β’ Published Jun 20, 2024 β’ 94
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 868
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper β’ 2409.17146 β’ Published Sep 25, 2024 β’ 119
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model By merve and 2 others β’ May 14, 2024 β’ 253
view article Article Illustrated LLM OS: An Implementational Perspective By shivance β’ Dec 3, 2023 β’ 20
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark Paper β’ 2409.02813 β’ Published Sep 4, 2024 β’ 32
view article Article Welcome Gemma 2 - Google's new open LLM By philschmid and 5 others β’ Jun 27, 2024 β’ 129
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated May 1 β’ 571