adaamko commited on
Commit
942a47e
·
verified ·
1 Parent(s): ae6bdef

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -3
README.md CHANGED
@@ -1,3 +1,38 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ base_model:
6
+ - answerdotai/ModernBERT-base
7
+ pipeline_tag: token-classification
8
+ tags:
9
+ - token classification
10
+ - hallucination detection
11
+ - transformers
12
+ ---
13
+
14
+ # LettuceDetect: Hallucination Detection Model
15
+
16
+ **Model Name:** lettucedect-base-modernbert-en-v1
17
+ **Organization:** KRLabsOrg
18
+
19
+ ## Overview
20
+
21
+ LettuceDetect is a transformer-based model for hallucination detection on context and answer pairs, designed for Retrieval-Augmented Generation (RAG) applications. This model is built on **ModernBERT**, which has been specifically chosen and trained becasue of its extended context support (up to **8192 tokens**). This long-context capability is critical for tasks where detailed and extensive documents need to be processed to accurately determine if an answer is supported by the provided context.
22
+
23
+ ## Model Details
24
+
25
+ - **Architecture:** ModernBERT (Base) with extended context support (up to 8192 tokens)
26
+ - **Task:** Token Classification / Hallucination Detection
27
+ - **Training Dataset:** RagTruth (with potential extensions to biomedical datasets)
28
+ - **Language:** English
29
+
30
+ ## How It Works
31
+
32
+ The model is trained to identify tokens in the answer text that are not supported by the given context. During inference, the model returns token-level predictions which are then aggregated into spans. This allows users to see exactly which parts of the answer are considered hallucinated.
33
+
34
+ ## Usage
35
+
36
+ ### Installation
37
+
38
+ Install the 'lettucedetect' repository