Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
vinhnx90
's Collections
Orpheus TTS Fine Tune
Phi GRPO Fine Tuning
Qwen GRPO Fine Tuning
Gemma 3 GRPO Fine Tuning
Models
Datasets
Spaces
Research Papers
Gemma 3 GRPO Fine Tuning
updated
Mar 22
My collecions of Gemma 3 1B RL fine-tuning using GPRO technique.
Upvote
-
vinhnx90/gemma-3-1b-thinking-v2
Text Generation
•
Updated
Mar 22
•
11
•
1
vinhnx90/gemma-3-1b-thinking-v2-mlx-4Bit
Text Generation
•
Updated
Mar 22
•
18
•
1
vinhnx90/gemma3-1b-thinking
Updated
Mar 22
•
5
vinhnx90/gemma-3-1b-thinking-v2-base-mlx-8Bit
Text Generation
•
Updated
Mar 22
•
17
•
1
vinhnx90/gemma-3-1b-thinking-v2-Q8_0-GGUF
Updated
Mar 22
•
3
•
1
vinhnx90/gemma-3-1b-thinking-v2-Q4_K_M-GGUF
Updated
Mar 22
•
20
•
3
vinhnx90/gemma-3-1b-thinking-v2-Q6_K-GGUF
Updated
Mar 22
•
1
vinhnx90/gemma-3-1b-thinking-v2-Q5_K_M-GGUF
Updated
Mar 22
vinhnx90/gemma-3-1b-thinking-v2-mlx-6Bit
Text Generation
•
Updated
Mar 22
•
16
Upvote
-
Share collection
View history
Collection guide
Browse collections