The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Paper β’ 2504.15521 β’ Published 18 days ago β’ 63
kaitchup/DeepSeek-R1-Distill-Qwen-14B-AutoRound-GPTQ-4bit Text Generation β’ Updated Jan 27 β’ 810 β’ 6
Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond β’ 7 items β’ Updated Mar 13 β’ 11
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper β’ 2502.14922 β’ Published Feb 19 β’ 31
TransMLA: Multi-head Latent Attention Is All You Need Paper β’ 2502.07864 β’ Published Feb 11 β’ 50