view article Article Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp 1 day ago • 6
UnstableLlama/Qwen3-30B-A3B-Instruct-2507_Pruned_REAP-15B-A3B-exl3 4B • Updated about 1 month ago • 7
UnstableLlama/Qwen3-30B-A3B-Instruct-2507_Pruned_REAP-15B-A3B-exl3 4B • Updated about 1 month ago • 7