matthewchung74
/

Qwen2.5_3B-GRPO-medical-reasoning

Text Generation

medical-reasoning

text-generation-inference

Model card Files Files and versions Community

matthewchung74 commited on Feb 23

Commit

b308a4f

·

verified ·

1 Parent(s): 00b6cfe

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 # Qwen2.5 3B GRPO Medical Reasoning Model
-A fine-tuned version of Qwen2.5 3B Instruct model using Generalized Reinforcement Policy Optimization (GRPO) for medical reasoning tasks. This model is intended for education purposes only.
 ## Model Details
@@ -33,11 +33,11 @@ This model is a fine-tuned version of Qwen2.5 3B Instruct, optimized for medical
 ### Direct Use
-This model is intended for education purposes only.
 ### Downstream Use
-This model is intended for education purposes only.
 ### Out-of-Scope Use

 # Qwen2.5 3B GRPO Medical Reasoning Model
+A fine-tuned version of Qwen2.5 3B Instruct model using Generalized Reinforcement Policy Optimization (GRPO) for medical reasoning tasks. This model is intended for education purposes only and not intended as medical advice.
 ## Model Details
 ### Direct Use
+This model is intended for education purposes only and not intended as providing medical advice.
 ### Downstream Use
+This model is intended for education purposes only and not intended as providing medical advice.
 ### Out-of-Scope Use