tasksource
/

ModernBERT-base-nli

Zero-Shot Classification

text-classification

natural-language-inference

Model card Files Files and versions Community

sileod commited on Jan 6

Commit

de4ab7e

·

verified ·

1 Parent(s): 1a7bbb4

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -24,6 +24,9 @@ The model was trained for 200k steps on an Nvidia A30 GPU.
 It is very good at reasoning tasks (better than llama 3.1 8B Instruct on ANLI and FOLIO), long context reasoning, sentiment analysis and zero-shot classification with new labels.
 | test_name                             |   test_accuracy |
 |:--------------------------------------|----------------:|
 | glue/mnli                             |            0.87 |

 It is very good at reasoning tasks (better than llama 3.1 8B Instruct on ANLI and FOLIO), long context reasoning, sentiment analysis and zero-shot classification with new labels.
+The following table shows model test accuracy. These are the scores for the same single transformer with different classification heads on top. Further gains can be obtained by fine-tuning on a single-task, e.g. SST, but it this checkpoint is great for zero-shot classification and natural language inference (contradiction/entailment/neutral classification).
 | test_name                             |   test_accuracy |
 |:--------------------------------------|----------------:|
 | glue/mnli                             |            0.87 |