Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published 16 days ago • 46
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 6 items • Updated May 9 • 1
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated May 30 • 10
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 870
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 6 items • Updated May 9 • 1
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 6 items • Updated May 9 • 1
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated May 30 • 201
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 6 items • Updated May 9 • 1
Tools 4 learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 10 items • Updated 8 days ago • 67
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 306
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others • Apr 5 • 145