Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
vinhnx90 's Collections
Orpheus TTS Fine Tune
Phi GRPO Fine Tuning
Qwen GRPO Fine Tuning
Gemma 3 GRPO Fine Tuning
Models
Datasets
Spaces
Research Papers

Gemma 3 GRPO Fine Tuning

updated Mar 22

My collecions of Gemma 3 1B RL fine-tuning using GPRO technique.

Upvote
-

  • vinhnx90/gemma-3-1b-thinking-v2

    Text Generation • 1.0B • Updated Mar 22 • 25 • 1

  • vinhnx90/gemma-3-1b-thinking-v2-mlx-4Bit

    Text Generation • 0.2B • Updated Mar 22 • 38 • 1

  • vinhnx90/gemma3-1b-thinking

    Updated Mar 22 • 5

  • vinhnx90/gemma-3-1b-thinking-v2-base-mlx-8Bit

    Text Generation • 0.4B • Updated Mar 22 • 31 • 1

  • vinhnx90/gemma-3-1b-thinking-v2-Q8_0-GGUF

    1.0B • Updated Mar 22 • 10 • 1

  • vinhnx90/gemma-3-1b-thinking-v2-Q4_K_M-GGUF

    1.0B • Updated Mar 22 • 30 • 3

  • vinhnx90/gemma-3-1b-thinking-v2-Q6_K-GGUF

    1.0B • Updated Mar 22 • 10

  • vinhnx90/gemma-3-1b-thinking-v2-Q5_K_M-GGUF

    1.0B • Updated Mar 22 • 6

  • vinhnx90/gemma-3-1b-thinking-v2-mlx-6Bit

    Text Generation • 0.3B • Updated Mar 22 • 28
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs