ruslandev
/

llama-3-8b-gpt-4o-ru1.0-gguf

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

ruslandev commited on Jun 30, 2024

Commit

7bd7c82

·

verified ·

1 Parent(s): 7f6c8da

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -21,10 +21,23 @@ I tried to achieve higher data quality by prompting GPT-4o, the latest OpenAI's
 The model shows promising results on the MT-Bench evaluation benchmark, surpassing GPT-3.5-turbo and being on par with [Suzume](https://huggingface.co/lightblue/suzume-llama-3-8B-multilingual) in Russian language scores,
 even though the latter is trained on 8x bigger and more diverse dataset.
 ## Evaluation scores
 I achieved the following scores on Ru/En MT-Bench:
 |            |meta-llama/Meta-Llama-3-8B-Instruct | ruslandev/llama-3-8b-gpt-4o-ru1.0 | lightblue/suzume-llama-3-8B-multilingual | Nexusflow/Starling-LM-7B-beta | gpt-3.5-turbo |
 |:----------:|:----------------------------------:|:---------------------------------:|:----------------------------------------:|:-----------------------------:|:-------------:|
 | Russian 🇷🇺 | NaN                                | 8.12                              | 8.19                                     | 8.06                          | 7.94          |

 The model shows promising results on the MT-Bench evaluation benchmark, surpassing GPT-3.5-turbo and being on par with [Suzume](https://huggingface.co/lightblue/suzume-llama-3-8B-multilingual) in Russian language scores,
 even though the latter is trained on 8x bigger and more diverse dataset.
+## How to use
+The easiest way to use this model on your own computer is to use the GGUF version of this model ([ruslandev/llama-3-8b-gpt-4o-ru1.0-gguf](https://huggingface.co/ruslandev/llama-3-8b-gpt-4o-ru1.0-gguf)) using a program such as [llama.cpp](https://github.com/ggerganov/llama.cpp).
+If you want to use this model directly with the Huggingface Transformers stack, I recommend using my framework [gptchain](https://github.com/RuslanPeresy/gptchain).
+```
+git clone https://github.com/RuslanPeresy/gptchain.git
+cd gptchain
+pip install -r requirements-train.txt
+python gptchain.py chat -m ruslandev/llama-3-8b-gpt-4o-ru1.0 \
+	--chatml true \
+	-q '[{"from": "human", "value": "Из чего состоит нейронная сеть?"}]'
+```
 ## Evaluation scores
 I achieved the following scores on Ru/En MT-Bench:
 |            |meta-llama/Meta-Llama-3-8B-Instruct | ruslandev/llama-3-8b-gpt-4o-ru1.0 | lightblue/suzume-llama-3-8B-multilingual | Nexusflow/Starling-LM-7B-beta | gpt-3.5-turbo |
 |:----------:|:----------------------------------:|:---------------------------------:|:----------------------------------------:|:-----------------------------:|:-------------:|
 | Russian 🇷🇺 | NaN                                | 8.12                              | 8.19                                     | 8.06                          | 7.94          |