Text Generation
Transformers
Safetensors
English
French
bitnet
mergekit
Merge
conversational
custom_code
jpacifico commited on
Commit
135a742
·
verified ·
1 Parent(s): 478f1d7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -79,6 +79,14 @@ Evaluations were performed using [LM Eval Harness](https://github.com/EleutherAI
79
  | jpacifico/bitnet-dpo-merged-modelstock7 | **51,62** |
80
 
81
 
 
 
 
 
 
 
 
 
82
  ## Last checkpoint
83
  ### Merge Method
84
 
@@ -111,6 +119,15 @@ tokenizer_source: jpacifico/bitnet-dpo-merged-modelstock-retrain
111
 
112
  ```
113
 
 
 
 
 
 
 
 
 
 
114
  - **Developed by:** Jonathan Pacifico, 2025
115
  - **Model type:** LLM
116
  - **Language(s) (NLP):** French, English
 
79
  | jpacifico/bitnet-dpo-merged-modelstock7 | **51,62** |
80
 
81
 
82
+
83
+ ## Usage
84
+
85
+ You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_14B_inference_test_colab.ipynb)
86
+
87
+ You can also run this model using the following code:
88
+
89
+
90
  ## Last checkpoint
91
  ### Merge Method
92
 
 
119
 
120
  ```
121
 
122
+ ## Limitations
123
+
124
+ Not tuned for coding or formal math; prefer specialized variants if those are critical.
125
+ No explicit chain-of-thought training; improvements come from bilingual DPO + merging.
126
+
127
+ **Disclamer**
128
+ This model is intended for research and development purposes only and should not be used in commercial or real-world applications without further testing. While the Microsoft Research team has applied SFT and DPO to align the BitNet base model, it may still produce unexpected, biased, or inaccurate outputs. Please use responsibly.
129
+
130
+
131
  - **Developed by:** Jonathan Pacifico, 2025
132
  - **Model type:** LLM
133
  - **Language(s) (NLP):** French, English