馃 DeBERTa-NCERT-Biology-QA

This model is a fine-tuned version of microsoft/deberta-v3-small on a chunk of the NCERT Class 11 Biology dataset. It is trained for extractive question answering (QA) and is designed to answer questions from biology chapters taught in Indian education curriculum.


馃摎 Dataset

The dataset was created from the official NCERT Class 11 Biology book, specifically:

  • Chunk Range: chunk_3000 to chunk_3143
  • Data Format: CSV with context-question-answer triplets
  • Task: Extractive QA (start & end position of answer in context)

鈿欙笍 Model Details

  • Base Model: microsoft/deberta-v3-small
  • Task: question-answering
  • Tokenizer: SentencePiece (spm.model) with custom vocabulary
  • Framework: 馃 Transformers + PyTorch
  • Optimized For: Low-resource devices (OpenVINO conversion available)

馃搱 Performance

Metric Value
Exact Match (EM) 87.5%
F1 Score 91.2%
Avg Confidence ~0.99 after fine-tuning
Loss Trend Decreasing steadily from 1.6 to 0.3
Epochs 2

馃煝 Confidence before training: ~0.006
馃煝 Confidence after training: ~0.99

Downloads last month
53
Safetensors
Model size
141M params
Tensor type
F32
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support