jpacifico
/

Aramis-2B-BitNet-bf16

Text Generation

Model card Files Files and versions

jpacifico commited on Aug 17

Commit

135a742

·

verified ·

1 Parent(s): 478f1d7

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -79,6 +79,14 @@ Evaluations were performed using [LM Eval Harness](https://github.com/EleutherAI
 | jpacifico/bitnet-dpo-merged-modelstock7            | **51,62**              |
 ## Last checkpoint
 ### Merge Method
@@ -111,6 +119,15 @@ tokenizer_source: jpacifico/bitnet-dpo-merged-modelstock-retrain
 ```
 - **Developed by:** Jonathan Pacifico, 2025
 - **Model type:** LLM
 - **Language(s) (NLP):** French, English

 | jpacifico/bitnet-dpo-merged-modelstock7            | **51,62**              |
+## Usage
+You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_14B_inference_test_colab.ipynb)
+You can also run this model using the following code:
 ## Last checkpoint
 ### Merge Method
 ```
+## Limitations
+Not tuned for coding or formal math; prefer specialized variants if those are critical.
+No explicit chain-of-thought training; improvements come from bilingual DPO + merging.
+**Disclamer**
+This model is intended for research and development purposes only and should not be used in commercial or real-world applications without further testing. While the Microsoft Research team has applied SFT and DPO to align the BitNet base model, it may still produce unexpected, biased, or inaccurate outputs. Please use responsibly.
 - **Developed by:** Jonathan Pacifico, 2025
 - **Model type:** LLM
 - **Language(s) (NLP):** French, English