rStar2-Agent: Agentic Reasoning Technical Report Paper β’ 2508.20722 β’ Published 12 days ago β’ 101
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others β’ Jun 3 β’ 86
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 17 items β’ Updated Jun 6, 2024 β’ 245
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 11 items β’ Updated Jul 21 β’ 535
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning Paper β’ 2407.10718 β’ Published Jul 15, 2024 β’ 19
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community By Leyo and 2 others β’ Apr 15, 2024 β’ 186
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Paper β’ 2403.09611 β’ Published Mar 14, 2024 β’ 130
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper β’ 2404.05719 β’ Published Apr 8, 2024 β’ 83
Restoration by Generation with Constrained Priors Paper β’ 2312.17161 β’ Published Dec 28, 2023 β’ 4