16 or 32 bit?

#19

by ChrisGoringe - opened Jul 5, 2024

Jul 5, 2024

The model card says "The native weights of this model were exported in bfloat16 precision"

But the config.json file says "torch_dtype": "float32" - and the fact that a 9B parameter model is 36GB to download suggests the same.

If it's really only bfloat16, could we have an 18GB download?

GPT007

Jul 12, 2024

They updated the model

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment