UMCU commited on
Commit
1cb557b
·
verified ·
1 Parent(s): 22a1883

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -12,5 +12,7 @@ tags:
12
  ---
13
 
14
 
15
- We used GPT4.1-nano to classify generic texts from OSCAR as non-medical/medical. We labeled 400.000 texts, with about 40.000 labeled as positive.
16
- We then trained a SequenceClassifier on 80.000 samples with a 50/50 class ratio.
 
 
 
12
  ---
13
 
14
 
15
+ We used GPT4.1-nano to classify generic texts from OSCAR as non-medical/medical using [PubScience](https://github.com/bramiozo/PubScience/tree/main/pubscience/label). We labeled 400.000 texts, with about 40.000 labeled as positive.
16
+ We then trained a SequenceClassifier on 80.000 samples with a 50/50 class ratio.
17
+
18
+ This can be used e.g.