Triangle104 commited on
Commit
f71f3dc
·
verified ·
1 Parent(s): b445602

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -16,6 +16,17 @@ tags:
16
  This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
17
  Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
18
 
 
 
 
 
 
 
 
 
 
 
 
19
  ## Use with llama.cpp
20
  Install llama.cpp through brew (works on Mac and Linux)
21
 
 
16
  This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
17
  Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
18
 
19
+ ---
20
+ OLMo 2 1B Instruct April 2025 is post-trained variant of the allenai/OLMo-2-0425-1B-RLVR1 model, which has undergone supervised finetuning on an OLMo-specific variant of the Tülu 3 dataset, further DPO training on this dataset, and final RLVR training on this dataset.
21
+ Tülu 3 is designed for state-of-the-art performance on a diversity of
22
+ tasks in addition to chat, such as MATH, GSM8K, and IFEval.
23
+ Check out the OLMo 2 paper or Tülu 3 paper for more details!
24
+
25
+
26
+ OLMo is a series of Open Language Models designed to enable the science of language models.
27
+ These models are trained on the Dolma dataset. We are releasing all code, checkpoints, logs, and associated training details.
28
+
29
+ ---
30
  ## Use with llama.cpp
31
  Install llama.cpp through brew (works on Mac and Linux)
32