Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DARJYO
/
persadian_14B-GRPO
like
0
Follow
DARJYO
1
Reinforcement Learning
Transformers
GGUF
English
text-generation-inference
trl
vllm
datasets
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
persadian_14B-GRPO
Ctrl+K
Ctrl+K
1 contributor
History:
6 commits
darjyo
Upload persadian_14B_GRPO.gguf
8ea353c
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
767 Bytes
Update README.md
6 months ago
persadian_14B_GRPO.gguf
480 kB
Upload persadian_14B_GRPO.gguf
6 months ago
persadian_14B_GRPO.ipynb
480 kB
Upload 2 files
6 months ago
persadian_14b_grpo.py
19.7 kB
Upload 2 files
6 months ago