Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
MonsterMMORPG 
posted an update about 17 hours ago
Post
377
Most Powerful Vision Model CogVLM 2 now works amazing on Windows with new Triton pre-compiled wheels - 19 Examples - Locally tested with 4-bit quantization - Second example is really wild - Can be used for image captioning or any image vision task

The APP and the installers : https://www.patreon.com/posts/120193330

Check below screenshots to see how to use it

Currently the APP works amazing with 4-bit quantization very fast

I am searching to lower VRAM usage even further with like adding CPU-Offloading and other stuff if possible

Previously we were lacking Triton but it now works perfect

My installer installs into a Python 3.10 VENV completely isolated and clean

You can see entire APP and installer source code

If you get Triton error make sure to delete your Triton cache after installing the app like below

C:\Users\Furkan.triton

Hugging Face repo with sample code : THUDM/cogvlm2-llama3-chat-19B

GitHub repo : https://github.com/THUDM/CogVLM2

Triton Windows : https://github.com/woct0rdho/triton-windows/releases