Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
matthewchung74
/
Qwen2.5_3B-GRPO-medical-reasoning
like
0
Text Generation
Transformers
Safetensors
qwen2
unsloth
trl
grpo
medical-reasoning
qwen
conversational
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5_3B-GRPO-medical-reasoning
Commit History
Update README.md
b308a4f
verified
matthewchung74
commited on
Feb 23
Update README.md
00b6cfe
verified
matthewchung74
commited on
Feb 23
Update README.md
71355f1
verified
matthewchung74
commited on
Feb 23
Trained with Unsloth
4e0e490
verified
matthewchung74
commited on
Feb 23
Trained with Unsloth
7e742e6
verified
matthewchung74
commited on
Feb 14
Upload tokenizer
21985eb
verified
matthewchung74
commited on
Feb 14
Upload tokenizer
118665d
verified
matthewchung74
commited on
Feb 13
Delete vocab.json
e34dcb7
verified
matthewchung74
commited on
Feb 13
Delete tokenizer_config.json
4f2b557
verified
matthewchung74
commited on
Feb 13
Delete tokenizer.json
9973920
verified
matthewchung74
commited on
Feb 13
Delete special_tokens_map.json
65818c6
verified
matthewchung74
commited on
Feb 13
Delete merges.txt
e750a4f
verified
matthewchung74
commited on
Feb 13
Delete added_tokens.json
523e377
verified
matthewchung74
commited on
Feb 13
Delete adapter_model.safetensors
8d51e8f
verified
matthewchung74
commited on
Feb 13
Delete adapter_config.json
c675a4e
verified
matthewchung74
commited on
Feb 13
Delete README.md
2046f08
verified
matthewchung74
commited on
Feb 13
Delete .gitattributes
ad81a06
verified
matthewchung74
commited on
Feb 13
Upload tokenizer
0972793
verified
matthewchung74
commited on
Feb 13
Upload model trained with Unsloth
03f611d
verified
matthewchung74
commited on
Feb 13
Trained with Unsloth
a1667b9
verified
matthewchung74
commited on
Feb 13
Trained with Unsloth
1562f95
verified
matthewchung74
commited on
Feb 13
Upload tokenizer
009ca62
verified
matthewchung74
commited on
Feb 13
initial commit
67bdf4e
verified
matthewchung74
commited on
Feb 13