DavidAU commited on
Commit
5b3eac4
·
verified ·
1 Parent(s): e6eec2c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -45,6 +45,7 @@ Quick testing shows optimized quants can:
45
  - Come up with a better answer/stronger reasoning.
46
  - Use less tokens to "reason" ... up to 50% less.
47
  - Faster and smaller quant size (VS "MAX" with output tensor and embed at BF16).
 
48
 
49
  Cost of the Augment:
50
  - Quants are are slightly larger.
 
45
  - Come up with a better answer/stronger reasoning.
46
  - Use less tokens to "reason" ... up to 50% less.
47
  - Faster and smaller quant size (VS "MAX" with output tensor and embed at BF16).
48
+ - Solution quality is also higher.
49
 
50
  Cost of the Augment:
51
  - Quants are are slightly larger.