MassiveDS Collection Data, embedding, and index of MassiveDS by "Scaling Retrieval-Based Language Models with a Trillion-Token Datastore" • 5 items • Updated Oct 22, 2024 • 2
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Paper • 2407.12854 • Published Jul 9, 2024 • 31
🔱 Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated 2 days ago • 24
Lucie LLM Collection Open source LLM for French, English, German, Spanish and Italian • 7 items • Updated 4 days ago • 20
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published 8 days ago • 26
🇫🇷 Calme-3 Collection Here you can find all the new Calme-3 models • 27 items • Updated 12 days ago • 13
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Paper • 2502.05179 • Published 14 days ago • 22
Kyro-n1 Collection Kyro-n1, Open-Neo's very first reasoning model with light to medium reasoning and powerful performance compared to other models in the same size. • 8 items • Updated 4 days ago • 7
Granite Data Collection This collection has a set of artifacts which are related to curating and evaluating datasets used for Granite models • 7 items • Updated 2 days ago • 2
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 12 items • Updated 1 day ago • 77
view article Article Announcing the winners of the Frugal AI Challenge 🌱 By frugal-ai-challenge and 1 other • 10 days ago • 6
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 1 day ago • 34
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 17 days ago • 187
CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance Paper • 2502.04350 • Published 17 days ago • 11