Snowflake/snowflake-arctic-embed-l-v2.0 Sentence Similarity • 0.6B • Updated Apr 25 • 148k • • 182
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • 24B • Updated May 9 • 198k • • 1.29k
bartowski/google_gemma-3-27b-it-qat-GGUF Image-Text-to-Text • 27B • Updated Apr 22 • 6.02k • 32
bartowski/google_gemma-3-12b-it-qat-GGUF Image-Text-to-Text • 12B • Updated Apr 18 • 3.38k • 22
google/gemma-3-27b-it-qat-q4_0-unquantized Image-Text-to-Text • 27B • Updated Apr 15 • 6.35k • 33
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • 253B • Updated May 8 • 88.5k • • 315
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other • Jan 16 • 75