Update README.md
Browse files
README.md
CHANGED
@@ -52,4 +52,13 @@ sequences = pipeline(
|
|
52 |
)
|
53 |
for seq in sequences:
|
54 |
print(f"Result: {seq['generated_text']}")
|
55 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
52 |
)
|
53 |
for seq in sequences:
|
54 |
print(f"Result: {seq['generated_text']}")
|
55 |
+
```
|
56 |
+
|
57 |
+
#### Eval
|
58 |
+
| Model | Pretrain Tokens | HellaSwag | Obqa | WinoGrande | ARC_c | ARC_e | boolq | piqa | avg |
|
59 |
+
|-------------------------------------------|-----------------|-----------|------|------------|-------|-------|-------|------|-----|
|
60 |
+
| Pythia-1.0B | 300B | 47.16 | 31.40| 53.43 | 27.05 | 48.99 | 60.83 | 69.21 | 48.30 |
|
61 |
+
| TinyLlama-1.1B-intermediate-step-50K-104b | 103B | 43.50 | 29.80| 53.28 | 24.32 | 44.91 | 59.66 | 67.30 | 46.11|
|
62 |
+
| TinyLlama-1.1B-intermediate-step-240k-503b| 503B | 49.56 |31.40 |55.80 |26.54 |48.32 |56.91 |69.42 | 48.28 |
|
63 |
+
| TinyLlama-1.1B-intermediate-step-480k-1007B | 1007B | 52.54 | 33.40 | 55.96 | 27.82 | 52.36 | 59.54 | 69.91 | 50.22 |
|
64 |
+
| TinyLlama-1.1B-intermediate-step-715k-1.5T | 1.49T | 53.68 | 35.20 | 58.33 | 29.18 | 51.89 | 59.08 | 71.65 | 51.29 |
|