Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
vinhnx90
's Collections
Phi GRPO Fine Tuning
Qwen GRPO Fine Tuning
Gemma 3 GRPO Fine Tuning
Models
Datasets
Spaces
Research Papers
Gemma 3 GRPO Fine Tuning
updated
3 days ago
My collecions of Gemma 3 1B RL fine-tuning using GPRO technique.
Upvote
-
vinhnx90/gemma-3-1b-thinking-v2
Text Generation
•
Updated
3 days ago
•
17
•
1
vinhnx90/gemma-3-1b-thinking-v2-mlx-4Bit
Text Generation
•
Updated
3 days ago
•
7
•
1
vinhnx90/gemma3-1b-thinking
Updated
3 days ago
•
5
vinhnx90/gemma-3-1b-thinking-v2-base-mlx-8Bit
Text Generation
•
Updated
3 days ago
•
4
•
1
vinhnx90/gemma-3-1b-thinking-v2-Q8_0-GGUF
Updated
3 days ago
•
70
•
1
vinhnx90/gemma-3-1b-thinking-v2-Q4_K_M-GGUF
Updated
3 days ago
•
40
•
2
vinhnx90/gemma-3-1b-thinking-v2-Q6_K-GGUF
Updated
3 days ago
•
24
vinhnx90/gemma-3-1b-thinking-v2-Q5_K_M-GGUF
Updated
3 days ago
•
25
vinhnx90/gemma-3-1b-thinking-v2-mlx-6Bit
Text Generation
•
Updated
3 days ago
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections