FIXED typos
Browse files
README.md
CHANGED
@@ -52,7 +52,7 @@ We employed LoRA fine-tuning on the microsoft/phi-4-mini-instruct model, applyin
|
|
52 |
|
53 |
|
54 |
|
55 |
-
Our evaluation focused on three benchmark tasks directly relevant to healthcare standards implementation: HIPAA-Compliance (accuracy in interpreting HIPAA requirements), GDPR-Healthcare (precision in applying GDPR to health data), and FHIR-Implementation (correctness in explaining FHIR data exchange standards). We
|
56 |
we evaluated the response quality of the RAFT model on 10 curated questions covering HIPAA, GDPR, ISO 45001, FHIR, and JCI standards. Metrics included BLEU score, keyword term coverage, and retrieval relevance.
|
57 |
|
58 |
| Metric | Base Model | RAFT Model | Improvement |
|
|
|
52 |
|
53 |
|
54 |
|
55 |
+
Our evaluation focused on three benchmark tasks directly relevant to healthcare standards implementation: HIPAA-Compliance (accuracy in interpreting HIPAA requirements), GDPR-Healthcare (precision in applying GDPR to health data), and FHIR-Implementation (correctness in explaining FHIR data exchange standards). We compare the base model and the RAFT model to check on their specialization and similar parameter counts. Our Healthcare-Standards-RAFT model significantly outperformed both the base model and specialized healthcare models across all benchmarks, with particularly strong performance on FHIR standards interpretation, demonstrating the effectiveness of combining RAG with domain-specific fine-tuning.
|
56 |
we evaluated the response quality of the RAFT model on 10 curated questions covering HIPAA, GDPR, ISO 45001, FHIR, and JCI standards. Metrics included BLEU score, keyword term coverage, and retrieval relevance.
|
57 |
|
58 |
| Metric | Base Model | RAFT Model | Improvement |
|