BLGS commited on
Commit
7bff4e9
·
verified ·
1 Parent(s): c529f7a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -4,6 +4,13 @@ language:
4
  - en
5
  pipeline_tag: text-generation
6
  ---
 
 
 
 
 
 
 
7
  ## Introduction
8
 
9
  SmallThinker is a family of **on-device native** Mixture-of-Experts (MoE) language models specially designed for local deployment,
 
4
  - en
5
  pipeline_tag: text-generation
6
  ---
7
+
8
+ ## SmallThinker-4BA0.6B-Instruct-GGUF
9
+
10
+ - GGUF models with `.gguf` suffix can used with [*llama.cpp* framwork](https://github.com/wdl339/llama.cpp/tree/smallthinker).
11
+
12
+ - GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer* framwork](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker).
13
+
14
  ## Introduction
15
 
16
  SmallThinker is a family of **on-device native** Mixture-of-Experts (MoE) language models specially designed for local deployment,