khalednabawi11's picture
Update README.md
a960399 verified
---
datasets:
- hongrui/mimic_chest_xray_v_1
---
# 🩺 CheXNet-MedScan-Report-Gen
**CheXNet-MedScan-Report-Gen** is an image captioning model for generating diagnostic text reports from chest X-ray images. It combines the power of a pretrained CheXNet encoder (based on DenseNet121) and a bidirectional LSTM decoder to produce sequence-based textual descriptions.
---
## 🧠 Model Architecture
- **Encoder:** DenseNet121 (CheXNet) with classifier removed
- **Decoder:** Bidirectional LSTM with dropout
- **Feature dimension:** 1024
- **Embedding dimension:** 256
- **Hidden dimension:** 512
- **Vocabulary size:** 5000
- **Dropout:** 0.5
---
## 🔧 Usage
You can load the model using the Hugging Face Transformers library:
```python
from transformers import AutoModel, AutoConfig
config = AutoConfig.from_pretrained("khalednabawi11/Chexnet-MedScan-Report-Gen", trust_remote_code=True)
model = AutoModel.from_pretrained("khalednabawi11/Chexnet-MedScan-Report-Gen", config=config, trust_remote_code=True)