Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,17 @@ tags:
|
|
16 |
This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
17 |
Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
|
18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
## Use with llama.cpp
|
20 |
Install llama.cpp through brew (works on Mac and Linux)
|
21 |
|
|
|
16 |
This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
17 |
Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
|
18 |
|
19 |
+
---
|
20 |
+
OLMo 2 1B Instruct April 2025 is post-trained variant of the allenai/OLMo-2-0425-1B-RLVR1 model, which has undergone supervised finetuning on an OLMo-specific variant of the Tülu 3 dataset, further DPO training on this dataset, and final RLVR training on this dataset.
|
21 |
+
Tülu 3 is designed for state-of-the-art performance on a diversity of
|
22 |
+
tasks in addition to chat, such as MATH, GSM8K, and IFEval.
|
23 |
+
Check out the OLMo 2 paper or Tülu 3 paper for more details!
|
24 |
+
|
25 |
+
|
26 |
+
OLMo is a series of Open Language Models designed to enable the science of language models.
|
27 |
+
These models are trained on the Dolma dataset. We are releasing all code, checkpoints, logs, and associated training details.
|
28 |
+
|
29 |
+
---
|
30 |
## Use with llama.cpp
|
31 |
Install llama.cpp through brew (works on Mac and Linux)
|
32 |
|