Compare against transformer

#23

by supercharge19 - opened 8 days ago

Discussion

supercharge19

8 days ago

Hi, there. How does it compare with transformer in speed and accuracy/quality?

BerenMillidge

Zyphra org 7 days ago

The hybrid model is about 10-20% faster depending on the context length used and requires less activation memory for KV cache. In terms of quality, they are about the same but the different models have subtly different strengths and weaknesses.

supercharge19

6 days ago

thank u

supercharge19 changed discussion status to closed 6 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment