Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
gmongaras
/
Llama3.1_8B_Instruct_GRPO_gsm8k
like
0
Model card
Files
Files and versions
xet
Community
gmongaras
commited on
Apr 15
Commit
c4c372d
·
verified
·
1 Parent(s):
84e2723
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
ADDED
Viewed
@@ -0,0 +1 @@
1
+
Model trained using scripts from [https://github.com/gmongaras/GRPO-DAPO-Tests](https://github.com/gmongaras/GRPO-DAPO-Tests)