Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
ProCreations 
posted an update about 7 hours ago
Post
167
Question about Intellite Chat to you guys!
(If you don’t know, Intellite Chat is my up-and-coming 100m parameter AI model focused on high-quality chat.)

What quantization variants would you want to see for Intellite Chat? It’ll come with FP32, FP16, and BF16, but any others you want? Maybe FP8 or even BitNet’s 1.58-bit quantization? Let me know!

fp8 is not required this is only 100m parameter(less than gpt2)so this can run even on mobile