view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 734
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 138
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Dec 13, 2024 • 85
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 26 days ago • 359
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 14 days ago • 192
Running on Zero 1.8k 1.8k Chat With Janus-Pro-7B 🌍 A unified multimodal understanding and generation model.