Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,146 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
- ro
|
6 |
+
base_model: google/gemma-3-4b-it
|
7 |
+
datasets:
|
8 |
+
- nicoboss/medra-medical
|
9 |
+
tags:
|
10 |
+
- text-generation
|
11 |
+
- medical-ai
|
12 |
+
- question-answering
|
13 |
+
- summarization
|
14 |
+
- dermatology
|
15 |
+
- gemma-3
|
16 |
+
- fine-tuned
|
17 |
+
pipeline_tag: text-generation
|
18 |
+
|
19 |
+
|
20 |
+
## Medra: Your Compact Medical Reasoning Partner
|
21 |
+
|
22 |
+
Model Size: 4b
|
23 |
+
Version: Medra v1 (Gemma Edition)
|
24 |
+
Format: GGUF (Q4, Q8, BF16)
|
25 |
+
License: Apache 2.0
|
26 |
+
Author: Dr. Alexandru Lupoi & @nicoboss
|
27 |
+
|
28 |
+
---
|
29 |
+
|
30 |
+
|
31 |
+

|
32 |
+
|
33 |
+
## Overview
|
34 |
+
|
35 |
+
**Medra** is a purpose-built, lightweight medical language model designed to assist in clinical reasoning, education, and dialogue modeling.
|
36 |
+
Built on top of **Gemma 3**, Medra is the first step in a long-term project to create deployable, interpretable, and ethically aligned AI support systems for medicine.
|
37 |
+
|
38 |
+
It is compact enough to run on consumer hardware.
|
39 |
+
Capable enough to support nuanced medical prompts.
|
40 |
+
And principled enough to never pretend to replace human judgment.
|
41 |
+
|
42 |
+
Medra is not a chatbot.
|
43 |
+
It is a **cognitive tool**—a reasoning companion for students, clinicians, and researchers exploring how AI can help illuminate the complexity of care without oversimplifying it.
|
44 |
+
|
45 |
+
---
|
46 |
+
|
47 |
+
## Purpose & Philosophy
|
48 |
+
|
49 |
+
Medra was developed to fill a crucial gap in the current AI landscape:
|
50 |
+
|
51 |
+
While many general-purpose LLMs excel at open-domain conversation, very few are optimized for **structured, medically relevant reasoning.**
|
52 |
+
Even fewer can run **locally**, offline, and in real-time—particularly in environments where access to massive models is impractical or unethical.
|
53 |
+
|
54 |
+
Medra aims to provide:
|
55 |
+
- Interpretable outputs for case simulation and review
|
56 |
+
- Support for differential diagnosis exploration
|
57 |
+
- A reflective partner for medical students
|
58 |
+
- A framework for reasoning refinement in applied clinical contexts
|
59 |
+
|
60 |
+
This project is rooted in the belief that AI in healthcare must be **transparent**, **educational**, and **augmentative**—not autonomous, extractive, or misleading.
|
61 |
+
|
62 |
+
---
|
63 |
+
|
64 |
+
## Key Capabilities
|
65 |
+
|
66 |
+
- **Lightweight Clinical Reasoning Core**
|
67 |
+
Medra is fine-tuned to support structured medical queries, diagnostic steps, SOAP formatting, and clinical questioning strategies.
|
68 |
+
|
69 |
+
- **Local and Mobile Friendly**
|
70 |
+
Offered in GGUF (Q4, Q8, BF16), Medra can run on local devices via Ollama, LM Studio, KoboldCpp, and other local inference engines—no API needed.
|
71 |
+
|
72 |
+
- **Data & Alignment**
|
73 |
+
Trained on medical content including PubMed-derived literature, reasoning datasets (e.g. R1 distilled), clinical notes, and prompt structures modeled after real-world physician interactions.
|
74 |
+
|
75 |
+
- **High Interpretability**
|
76 |
+
Designed for transparency and reflection—not black-box decision-making. Medra works best when prompted like a partner, not a prophet.
|
77 |
+
|
78 |
+
- **Designed for Ethical Integration**
|
79 |
+
Built with the explicit goal of remaining aligned, cautious, and useful for **human-in-the-loop** medical settings.
|
80 |
+
|
81 |
+
---
|
82 |
+
|
83 |
+
## Intended Use
|
84 |
+
|
85 |
+
- Medical education and exam-style reasoning
|
86 |
+
- Case-based learning simulation
|
87 |
+
- AI health assistant prototyping
|
88 |
+
- Dialogue modeling in therapeutic or diagnostic contexts
|
89 |
+
- As a tool for **thinking alongside**, not thinking *instead of*
|
90 |
+
|
91 |
+
---
|
92 |
+
|
93 |
+
## Limitations
|
94 |
+
|
95 |
+
- Medra is **not a licensed medical professional.**
|
96 |
+
It is not intended for real-world diagnosis, treatment planning, or patient interaction without human oversight.
|
97 |
+
|
98 |
+
- The model may hallucinate, oversimplify, or present outdated medical knowledge in edge cases.
|
99 |
+
|
100 |
+
- Medra is not currently equipped with long-term memory, real-world clinical data access, or the authority to guide care.
|
101 |
+
|
102 |
+
- It is a prototype. A foundation. Not a finished replacement for expertise.
|
103 |
+
|
104 |
+
---
|
105 |
+
|
106 |
+
## Technical Details
|
107 |
+
|
108 |
+
- **Base model:** Gemma 3
|
109 |
+
- **Fine-tuning stages:** Instructional tuning (STF); RLHF planned in upcoming release
|
110 |
+
- **Data domains:** Medical Q&A, differential diagnosis formats, clinical conversation datasets, PubMed-derived material
|
111 |
+
- **Supported inference engines:** Ollama, LM Studio, KoboldCpp, GGML-compatible platforms
|
112 |
+
- **Quantization formats:** Q4, Q8, BF16
|
113 |
+
|
114 |
+
---
|
115 |
+
|
116 |
+
## License
|
117 |
+
|
118 |
+
Apache 2.0
|
119 |
+
|
120 |
+
---
|
121 |
+
|
122 |
+
## The Medra Family
|
123 |
+
|
124 |
+
Medra is part of a growing family of medical reasoning models:
|
125 |
+
|
126 |
+
- **Medra** — Gemma-based compact model for lightweight local inference
|
127 |
+
- **MedraQ** — Qwen 3-based, multilingual and adaptive version
|
128 |
+
- **MedraOmni** — Future flagship model built on Qwen 2.5 Omni with full multimodal support
|
129 |
+
|
130 |
+
Each model in the series is purpose-built, ethically scoped, and focused on responsible augmentation of healthcare knowledge—not its replacement.
|
131 |
+
|
132 |
+
---
|
133 |
+
|
134 |
+
## Final Note
|
135 |
+
|
136 |
+
Medra exists because medicine deserves tools that reflect **care**, not just computation.
|
137 |
+
It is small, but intentional.
|
138 |
+
Experimental, but serious.
|
139 |
+
And it was built with one purpose:
|
140 |
+
|
141 |
+
> To make intelligent care more accessible, more transparent, and more aligned with the human beings it’s meant to serve.
|
142 |
+
# Uploaded finetuned model
|
143 |
+
|
144 |
+
- **Developed by:** drwlf & nicoboss
|
145 |
+
- **License:** apache-2.0
|
146 |
+
- **Finetuned from model :** unsloth/gemma-3-4b-it-unsloth-bnb-4bit
|