Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ tags:
|
|
10 |
|
11 |
# Qwen2.5 3B GRPO Medical Reasoning Model
|
12 |
|
13 |
-
A fine-tuned version of Qwen2.5 3B Instruct model using Generalized Reinforcement Policy Optimization (GRPO) for medical reasoning tasks. This model is intended for education purposes only.
|
14 |
|
15 |
## Model Details
|
16 |
|
@@ -33,11 +33,11 @@ This model is a fine-tuned version of Qwen2.5 3B Instruct, optimized for medical
|
|
33 |
|
34 |
### Direct Use
|
35 |
|
36 |
-
This model is intended for education purposes only.
|
37 |
|
38 |
### Downstream Use
|
39 |
|
40 |
-
This model is intended for education purposes only.
|
41 |
|
42 |
### Out-of-Scope Use
|
43 |
|
|
|
10 |
|
11 |
# Qwen2.5 3B GRPO Medical Reasoning Model
|
12 |
|
13 |
+
A fine-tuned version of Qwen2.5 3B Instruct model using Generalized Reinforcement Policy Optimization (GRPO) for medical reasoning tasks. This model is intended for education purposes only and not intended as medical advice.
|
14 |
|
15 |
## Model Details
|
16 |
|
|
|
33 |
|
34 |
### Direct Use
|
35 |
|
36 |
+
This model is intended for education purposes only and not intended as providing medical advice.
|
37 |
|
38 |
### Downstream Use
|
39 |
|
40 |
+
This model is intended for education purposes only and not intended as providing medical advice.
|
41 |
|
42 |
### Out-of-Scope Use
|
43 |
|