HANI-LAB
/

Med-REFL-MedReason-8B-lora

Question Answering

medical-reasoning

Model card Files Files and versions Community

CityU-Zongxian commited on Jun 19

Commit

b048e80

·

verified ·

1 Parent(s): 779462d

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -31,6 +31,17 @@ Instead of focusing solely on the final answer, Med-REFL improves the model's in
 This repository contains the LoRA weights produced by the Med-REFL framework for various base models.
 # <span>Available Weights</span>
 The Med-REFL LoRA weights can be applied to the following base models to enhance their medical reasoning abilities.

 This repository contains the LoRA weights produced by the Med-REFL framework for various base models.
+# <span>MedReason-8B Model Performance</span>
+The following table shows the performance of the MedReason-8B model on In-Domain and Out-of-Domain benchmarks before and after applying Med-REFL.
+| Domain | Benchmark | Original | **+ Med-REFL** |
+| :--- | :--- | :--- | :--- |
+| **In-Domain** | MedQA-USMLE | 66.27 | **70.16** <span style="color: #2E8B57; font-size: small;">(+3.89)</span> |
+| **Out-of-Domain**| MedMCQA | 58.98 | **59.78** <span style="color: #2E8B57; font-size: small;">(+0.80)</span> |
+| **Out-of-Domain**| GPQA (Med+) | 45.64 | **49.84** <span style="color: #2E8B57; font-size: small;">(+4.20)</span> |
+| **Out-of-Domain**| MMLU-Pro (Med+) | 59.14 | **62.51** <span style="color: #2E8B57; font-size: small;">(+3.37)</span> |
 # <span>Available Weights</span>
 The Med-REFL LoRA weights can be applied to the following base models to enhance their medical reasoning abilities.