Inference Providers
Active filters: gpu
Jay0515/onnxruntime-gpu-aarch64-cuda13-sm121
Other
• Updated • 3
magiccodingman/Apriel-1.5-15b-Thinker-unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 14B • Updated • 174
• 2
Vishal74/Seq2SeqModel_LSTM
Updated
Tech-Meld/gpus-everywhere
Text-to-Image
• Updated • 8
• • 1
vhab10/llama_3.1_8b_Q4_K_M-gguf
Text Generation
• 8B • Updated • 7
Text Generation
• 4B • Updated • 26
mradermacher/Loxa-4B-GGUF
4B • Updated • 105
mradermacher/Loxa-4B-i1-GGUF
4B • Updated • 122
Text Generation
• 4B • Updated • 19
mradermacher/CodeLoxa-4B-GGUF
4B • Updated • 56
• 1
mradermacher/CodeLoxa-4B-i1-GGUF
4B • Updated • 187
Text Generation
• 2B • Updated • 4
• 1
mradermacher/Loxa-1.6B-GGUF
2B • Updated • 46
mradermacher/Loxa-1.6B-i1-GGUF
2B • Updated • 174
frameai/Loxa-1.6B-uncensored
Text Generation
• 2B • Updated • 6
• 1
mradermacher/Loxa-1.6B-uncensored-GGUF
2B • Updated • 57
• 2
mradermacher/Loxa-1.6B-uncensored-i1-GGUF
2B • Updated • 156
ConfidentialMind/gte-multilingual-reranker-base-onnx-op14-opt-gpu-int8
Sentence Similarity
• Updated • 10
• 1
ConfidentialMind/gte-multilingual-reranker-base-onnx-op14-opt-gpu
Sentence Similarity
• Updated • 9
ConfidentialMind/gte-multilingual-reranker-base-onnx-op19-opt-gpu
Sentence Similarity
• Updated • 7
Robotics
• Updated sbeierle/fame-pytorch-kit
Updated
excribe/classifer_sgd_longformer_4099
Text Classification
• 0.1B • Updated • 7
Text Generation
• Updated AhmedAyman/k2-think-cuda-1505
Text Generation
• Updated • 4
Eltamuan/Gravitas-Torch-2.8-Blackwell-Edition
Updated
magiccodingman/Qwen3-4B-Instruct-2507-MXFP4-Hybrid-GGUF
Text Generation
• 4B • Updated • 196
magiccodingman/Qwen3-4B-Thinking-2507-MXFP4-Hybrid-GGUF
Text Generation
• 4B • Updated • 27
• 1
magiccodingman/Qwen3-4B-Thinking-2507-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 4B • Updated • 67
• 1