FP8 weights

#41

by getfit - opened 11 days ago

getfit

11 days ago

Push a FP8 release? looks like llmcompressor does not support the arch yet.

getfit

10 days ago

Has anyone gotten this to convert ?

Meta Llama org 10 days ago

@getfit : Thanks for your question! We used the llmcompressor recipe to create the FP8 checkpoint for Maverick here: https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8.
We'll confirm with the team with ETA of adding FP8 and INT4 for Scout. cc: @wukaixingxp @Hamid-Nazeri

7 days ago

@yecharlotteqi Are there any updates on this?

7 days ago

Just found this model for anyone looking

It should work with vLLM but haven't tested it yet

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment