OptimusePrime
/

Magistral-Small-2506-Vision

Image-Text-to-Text

Model card Files Files and versions Community

OptimusePrime commited on Jun 16

Commit

3392d4c

·

verified ·

1 Parent(s): 27e14d1

Add the code

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -49,4 +49,8 @@ The model was tested with vLLM and should work with any toolkit supporting Mistr
 Make sure to use the system prompt provided in the `SYSTEM_PROMPT.txt` file (from Mistral's docs) and the sampling params `temp=0.7, top_p=0.95`.
 There still may be configuration errors in this model which might reduce performance. Let me know if you encounter any weird behavior!

 Make sure to use the system prompt provided in the `SYSTEM_PROMPT.txt` file (from Mistral's docs) and the sampling params `temp=0.7, top_p=0.95`.
+The code used for creating this model can be found here: https://colab.research.google.com/drive/1UuMo4VSgVoD4GfLrFgHUJvCv0cdALR7m?usp=sharing.
+It requires ~150 GB of RAM (VRAM is not needed for this) since it loads three 24B models in BF16.
+4-bit bits and bytes quantization could be used to reduce the memory requirements to 1/4.
 There still may be configuration errors in this model which might reduce performance. Let me know if you encounter any weird behavior!