Update README.md
Browse files
README.md
CHANGED
@@ -12,5 +12,7 @@ tags:
|
|
12 |
---
|
13 |
|
14 |
|
15 |
-
We used GPT4.1-nano to classify generic texts from OSCAR as non-medical/medical. We labeled 400.000 texts, with about 40.000 labeled as positive.
|
16 |
-
We then trained a SequenceClassifier on 80.000 samples with a 50/50 class ratio.
|
|
|
|
|
|
12 |
---
|
13 |
|
14 |
|
15 |
+
We used GPT4.1-nano to classify generic texts from OSCAR as non-medical/medical using [PubScience](https://github.com/bramiozo/PubScience/tree/main/pubscience/label). We labeled 400.000 texts, with about 40.000 labeled as positive.
|
16 |
+
We then trained a SequenceClassifier on 80.000 samples with a 50/50 class ratio.
|
17 |
+
|
18 |
+
This can be used e.g.
|