Gurvaah Singh's picture

In a Training Loop 🔄

20 286

Gurvaah Singh

ReallyFloppyPenguin

·

https://gurvaahsingh.vercel.app/

ReallyFloppyPenguin

AI & ML interests

AI, GGUFing AI, AI, Running AI, Thinking about AI, and so on

Recent Activity

liked a dataset 13 days ago

Lichess/standard-chess-games

liked a Space about 1 month ago

Qwen/Qwen-Image-Layered

liked a Space 3 months ago

mteb/leaderboard

View all activity

Organizations

upvoted a collection 6 months ago

DeepSeek-V3.1

4 items • Updated Nov 27, 2025 • 259

upvoted an article 6 months ago

Article

Create Mixtures of Experts with MergeKit

Mar 28, 2024

•

27

upvoted a collection 6 months ago

MathRL

Note: The solution may not be in `solution` or `answer` columns, but inside /boxed/{ANSWER} • 13 items • Updated Aug 16, 2025 • 1

upvoted a paper 6 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 130

upvoted a collection 6 months ago

oh

13 items • Updated Mar 5, 2025 • 1

upvoted an article 6 months ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

+5

Apr 5, 2023

•

48

upvoted a collection 7 months ago

Reward Models 06-2025

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 1 day ago • 23

upvoted 2 collections 8 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 690

GGUFs

21 items • Updated Jul 9, 2025 • 1

upvoted 2 changelogs 8 months ago

Changelog

New Model Filtering Options on the Hub

Jun 16, 2025

• 76

Changelog

Add MCP-Compatible Spaces to Your Tools

Jun 17, 2025

• 86

upvoted a collection 8 months ago

Models I WIll GGUF

MODELS MUST BE <=22B. To add to this open this link: https://huggingface.co/collections/ReallyFloppyPenguin/models2gguflater-68503439edc1aa25cce7c79b • 0 items • Updated Jun 23, 2025 • 1

upvoted a changelog 8 months ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6, 2025

• 112

upvoted a paper 9 months ago

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16, 2025 • 57

upvoted a collection 9 months ago

Interesting Papers

4 items • Updated Jun 23, 2025 • 1

upvoted 3 papers 9 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7, 2025 • 82

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14, 2025 • 71

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 189

upvoted a collection 9 months ago

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 247