有人测试过在本地跑多大显存占用吗？

by weisiren - opened May 1

May 1

有人测试过在本地跑多大显存占用吗？

JetBrains org May 28

•

In fp16:

So 10-12 Gb should be enough to run inference comfortably in fp16.

If you want to reduce memory consumption you can use our 8bit gguf checkpoints: https://huggingface.co/JetBrains/Mellum-4b-base-gguf

zerogerc changed discussion status to closed 14 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment