Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Together AI
Novita
Cerebras
Nscale
Nebius AI Studio
SambaNova
Replicate
Cohere
fal
Hyperbolic
HF Inference API
Misc
Reset Misc
GRPO
Inference Endpoints
text-generation-inference
Merge
4-bit precision
custom_code
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
98
Full-text search
Edit filters
Sort: Trending
Active filters:
GRPO
Clear all
Nitral-AI/Captain-Eris_Violet-GRPO-v0.420
Text Generation
•
Updated
Apr 14
•
125
•
•
22
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
Updated
Jan 30
•
865
•
20
prithivMLmods/Bellatrix-Tiny-1B-R1
Text Generation
•
Updated
Feb 2
•
17
•
1
mradermacher/Bellatrix-Tiny-1B-R1-GGUF
Updated
Feb 3
•
116
mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF
Updated
Feb 3
•
181
Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF
Text Generation
•
Updated
Feb 3
•
4
Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
5
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
19
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
3
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
10
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
12
Triangle104/Bellatrix-Tiny-1B-R1-Q6_K-GGUF
Text Generation
•
Updated
Feb 3
•
10
Triangle104/Bellatrix-Tiny-1B-R1-Q8_0-GGUF
Text Generation
•
Updated
Feb 3
•
7
tecosys/Nutaan-RL1
Reinforcement Learning
•
Updated
Feb 7
•
258
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
Updated
Feb 9
•
109
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
Updated
Feb 9
•
102
alpha-ai/Deep-Reason-SMALL-V0-GGUF
Updated
Feb 26
•
61
•
1
alpha-ai/Deep-Reason-SMALL-V0
Text Generation
•
Updated
Feb 26
•
16
•
2
mradermacher/Deep-Reason-SMALL-V0-GGUF
Updated
Feb 9
•
40
•
2
mradermacher/Deep-Reason-SMALL-V0-i1-GGUF
Updated
Feb 9
•
116
•
1
alpha-ai/qwen2.5-reason-thought-lite-GGUF
Updated
Apr 28
•
46
alpha-ai/qwen2.5-reason-thought-lite
Text Generation
•
Updated
Apr 28
•
11
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF
Updated
Feb 26
•
37
•
1
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
•
Updated
Feb 26
•
13
Daemontatox/Cogito-R1
Text Generation
•
Updated
Feb 19
•
11
•
5
mradermacher/Cogito-R1-GGUF
Updated
Feb 12
•
76
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
•
Updated
Feb 12
•
26
mradermacher/Cogito-R1-i1-GGUF
Updated
Feb 13
•
438
AaryanK/Qwen_2.5_3B_GRPO_Reasoning_XIOSERV
Updated
Feb 17
•
33
•
1
prithivMLmods/SmolLM2_135M_Grpo_Gsm8k
Text Generation
•
Updated
Feb 17
•
72
•
7
Previous
1
2
3
4
Next