view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages By davanstrien and 5 others β’ Jul 8 β’ 30
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others β’ Jul 8 β’ 647
view article Article Fixing Open LLM Leaderboard with Math-Verify By hynky and 3 others β’ Feb 14 β’ 30
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others β’ Dec 23, 2024 β’ 21
view article Article π¨πΏ BenCzechMark - Can your LLM Understand Czech? By mfajcik and 12 others β’ Oct 1, 2024 β’ 22