Fast inference for Blackwell GPUs
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text • 5B • Updated • 31 -
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text • 8B • Updated • 33 -
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text • 33B • Updated • 67 -
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text • 73B • Updated • 41
Fast inference for Blackwell GPUs
-
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text • 5B • Updated • 31 -
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text • 8B • Updated • 33 -
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text • 33B • Updated • 67 -
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text • 73B • Updated • 41
models
13
ig1/Qwen3-30B-A3B-Instruct-2507-NVFP4
17B
•
Updated
•
13
ig1/Qwen3-30B-A3B-NVFP4
17B
•
Updated
•
30
ig1/Qwen3-VL-30B-A3B-Instruct-NVFP4
Image-Text-to-Text
•
18B
•
Updated
•
99
•
1
ig1/Qwen3-Coder-30B-A3B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
188
•
1
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text
•
5B
•
Updated
•
31
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
8B
•
Updated
•
33
ig1/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
159
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
33B
•
Updated
•
67
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
73B
•
Updated
•
41
ig1/r1-1776-AWQ
Updated
•
1
datasets
0
None public yet