2000+ Run LLMs here - Directly in your browser Collection Try out LLMs here , there are also a few utilities, & image based models are further down in this list. See "Run Image Gen models..." collection too. • 27 items • Updated 18 days ago • 6
MathRL Collection Note: The solution may not be in `solution` or `answer` columns, but inside /boxed/{ANSWER} • 13 items • Updated 5 days ago • 1
view article Article StackLLaMA: A hands-on guide to train LLaMA with RLHF By edbeeching and 6 others • Apr 5, 2023 • 43
Reward Models Collection Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 6 days ago • 20
Models I WIll GGUF Collection MODELS MUST BE <=22B. To add to this open this link: https://huggingface.co/collections/ReallyFloppyPenguin/models2gguflater-68503439edc1aa25cce7c79b • 0 items • Updated Jun 23 • 1
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7 • 83
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6 • 182
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 245