Files changed (1) hide show
  1. README.md +53 -34
README.md CHANGED
@@ -1,34 +1,53 @@
1
- ---
2
- license: apache-2.0
3
- inference: false
4
- base_model: Qwen/Qwen2.5-3B-Instruct
5
- base_model_relation: quantized
6
- tags: [green, llmware-chat, p3, ov, emerald]
7
- ---
8
-
9
- # qwen2.5-3b-instruct-ov
10
-
11
- **qwen2.5-3b-instruct-ov** is an OpenVino int4 quantized version of [Qwen2.5-3B-Instruct](https://www.huggingface.co/Qwen/Qwen2.5-3B-Instruct), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
12
-
13
- This is from the latest release series from Qwen.
14
-
15
- ### Model Description
16
-
17
- - **Developed by:** Qwen
18
- - **Quantized by:** llmware
19
- - **Model type:** qwen2.5
20
- - **Parameters:** 3 billion
21
- - **Model Parent:** Qwen/Qwen2.5-3B-Instruct
22
- - **Language(s) (NLP):** English
23
- - **License:** Apache 2.0
24
- - **Uses:** Chat, general-purpose LLM
25
- - **Quantization:** int4
26
-
27
-
28
- ## Model Card Contact
29
-
30
- [llmware on github](https://www.github.com/llmware-ai/llmware)
31
-
32
- [llmware on hf](https://www.huggingface.co/llmware)
33
-
34
- [llmware website](https://www.llmware.ai)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ inference: false
4
+ base_model: Qwen/Qwen2.5-3B-Instruct
5
+ base_model_relation: quantized
6
+ tags:
7
+ - green
8
+ - llmware-chat
9
+ - p3
10
+ - ov
11
+ - emerald
12
+ language:
13
+ - zho
14
+ - eng
15
+ - fra
16
+ - spa
17
+ - por
18
+ - deu
19
+ - ita
20
+ - rus
21
+ - jpn
22
+ - kor
23
+ - vie
24
+ - tha
25
+ - ara
26
+ ---
27
+
28
+ # qwen2.5-3b-instruct-ov
29
+
30
+ **qwen2.5-3b-instruct-ov** is an OpenVino int4 quantized version of [Qwen2.5-3B-Instruct](https://www.huggingface.co/Qwen/Qwen2.5-3B-Instruct), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
31
+
32
+ This is from the latest release series from Qwen.
33
+
34
+ ### Model Description
35
+
36
+ - **Developed by:** Qwen
37
+ - **Quantized by:** llmware
38
+ - **Model type:** qwen2.5
39
+ - **Parameters:** 3 billion
40
+ - **Model Parent:** Qwen/Qwen2.5-3B-Instruct
41
+ - **Language(s) (NLP):** English
42
+ - **License:** Apache 2.0
43
+ - **Uses:** Chat, general-purpose LLM
44
+ - **Quantization:** int4
45
+
46
+
47
+ ## Model Card Contact
48
+
49
+ [llmware on github](https://www.github.com/llmware-ai/llmware)
50
+
51
+ [llmware on hf](https://www.huggingface.co/llmware)
52
+
53
+ [llmware website](https://www.llmware.ai)