Update README.md
Browse files
README.md
CHANGED
@@ -42,6 +42,10 @@ For the Reinforcement Learning (RL) stage, we designed a two-stage training proc
|
|
42 |
|
43 |
## III. Evaluation Results
|
44 |
|
|
|
|
|
|
|
|
|
45 |
Our II-Medical-8B-1706 model achieved a 46.8% score on [HealthBench](https://openai.com/index/healthbench/), a comprehensive open-source benchmark evaluating the performance and safety of large language models in healthcare. This performance is comparable to MedGemma-27B from Google. We provide a comparison to models available in ChatGPT below.
|
46 |
|
47 |
<!--  -->
|
|
|
42 |
|
43 |
## III. Evaluation Results
|
44 |
|
45 |
+

|
46 |
+
|
47 |
+

|
48 |
+
|
49 |
Our II-Medical-8B-1706 model achieved a 46.8% score on [HealthBench](https://openai.com/index/healthbench/), a comprehensive open-source benchmark evaluating the performance and safety of large language models in healthcare. This performance is comparable to MedGemma-27B from Google. We provide a comparison to models available in ChatGPT below.
|
50 |
|
51 |
<!--  -->
|