byh711
/

Qwen2.5-3B-MATH-GRPO-KOR

Reinforcement Learning

Model card Files Files and versions Community

Qwen2.5-3B-MATH-GRPO-KOR

Ctrl+K

Ctrl+K

1 contributor

History: 5 commits

byh711's picture

Update README.md

a89a1c9 verified 6 days ago

.gitattributes

1.57 kB

Upload tokenizer (Trained with Unsloth) 6 days ago
README.md
2.27 kB

Update README.md 6 days ago
adapter_config.json

817 Bytes

Initial upload of Korean Math GRPO model (Trained with Unsloth) 6 days ago
adapter_model.safetensors

479 MB
LFS

Initial upload of Korean Math GRPO model (Trained with Unsloth) 6 days ago
added_tokens.json

605 Bytes

Upload tokenizer (Trained with Unsloth) 6 days ago
chat_template.jinja

2.51 kB

Upload tokenizer (Trained with Unsloth) 6 days ago
merges.txt

1.67 MB

Upload tokenizer (Trained with Unsloth) 6 days ago
special_tokens_map.json

614 Bytes

Upload tokenizer (Trained with Unsloth) 6 days ago
tokenizer.json

11.4 MB
LFS

Upload tokenizer (Trained with Unsloth) 6 days ago
tokenizer_config.json

4.71 kB

Upload tokenizer (Trained with Unsloth) 6 days ago
vocab.json

2.78 MB

Upload tokenizer (Trained with Unsloth) 6 days ago