Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published 16 days ago • 46
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated May 30 • 10
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 870
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 6 items • Updated May 9 • 1
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated May 30 • 201
Tools 4 learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 10 items • Updated 8 days ago • 67
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 306
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others • Apr 5 • 145
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 384
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 439
view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others • Nov 26, 2024 • 326
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models By loubnabnl and 2 others • Mar 20, 2024 • 96
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture Paper • 2401.08406 • Published Jan 16, 2024 • 37