Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

DARJYO
/
persadian_14B-GRPO

Reinforcement Learning
Transformers
GGUF
English
text-generation-inference
trl
vllm
datasets
Model card Files Files and versions Community
persadian_14B-GRPO
Ctrl+K
Ctrl+K
  • 1 contributor
History: 6 commits
darjyo
Upload persadian_14B_GRPO.gguf
8ea353c verified 6 months ago
  • .gitattributes
    1.52 kB
    initial commit 6 months ago
  • README.md
    767 Bytes
    Update README.md 6 months ago
  • persadian_14B_GRPO.gguf
    480 kB
    Upload persadian_14B_GRPO.gguf 6 months ago
  • persadian_14B_GRPO.ipynb
    480 kB
    Upload 2 files 6 months ago
  • persadian_14b_grpo.py
    19.7 kB
    Upload 2 files 6 months ago