Update README.md
Browse files
README.md
CHANGED
@@ -79,6 +79,14 @@ Evaluations were performed using [LM Eval Harness](https://github.com/EleutherAI
|
|
79 |
| jpacifico/bitnet-dpo-merged-modelstock7 | **51,62** |
|
80 |
|
81 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
82 |
## Last checkpoint
|
83 |
### Merge Method
|
84 |
|
@@ -111,6 +119,15 @@ tokenizer_source: jpacifico/bitnet-dpo-merged-modelstock-retrain
|
|
111 |
|
112 |
```
|
113 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
114 |
- **Developed by:** Jonathan Pacifico, 2025
|
115 |
- **Model type:** LLM
|
116 |
- **Language(s) (NLP):** French, English
|
|
|
79 |
| jpacifico/bitnet-dpo-merged-modelstock7 | **51,62** |
|
80 |
|
81 |
|
82 |
+
|
83 |
+
## Usage
|
84 |
+
|
85 |
+
You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_14B_inference_test_colab.ipynb)
|
86 |
+
|
87 |
+
You can also run this model using the following code:
|
88 |
+
|
89 |
+
|
90 |
## Last checkpoint
|
91 |
### Merge Method
|
92 |
|
|
|
119 |
|
120 |
```
|
121 |
|
122 |
+
## Limitations
|
123 |
+
|
124 |
+
Not tuned for coding or formal math; prefer specialized variants if those are critical.
|
125 |
+
No explicit chain-of-thought training; improvements come from bilingual DPO + merging.
|
126 |
+
|
127 |
+
**Disclamer**
|
128 |
+
This model is intended for research and development purposes only and should not be used in commercial or real-world applications without further testing. While the Microsoft Research team has applied SFT and DPO to align the BitNet base model, it may still produce unexpected, biased, or inaccurate outputs. Please use responsibly.
|
129 |
+
|
130 |
+
|
131 |
- **Developed by:** Jonathan Pacifico, 2025
|
132 |
- **Model type:** LLM
|
133 |
- **Language(s) (NLP):** French, English
|