Inference Providers
Active filters: modelopt
chankhavu/Nemotron-Cascade-2-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 7.68k
• 13
bg-digitalservices/Gemma-4-26B-A4B-it-NVFP4
Text Generation
• 15B • Updated • 284k
• 32
bahadirakdemir/gemma-4-31B-it-text-fp8
Text Generation
• 31B • Updated • 1.23k
• 2
ykarout/Qwen3.6-35B-A3B-NVFP4
Image-Text-to-Text
• 36B • Updated • 12.8k
• 1
mmangkad/Qwen3.6-27B-NVFP4
Text Generation
• 20B • Updated • 47.8k
• 6
sakamakismile/Huihui-Qwen3.6-27B-abliterated-NVFP4-TEXT-MTP
Text Generation
• 17B • Updated • 9.75k
• 3
sakamakismile/Carnice-V2-27b-NVFP4-TEXT-MTP
Text Generation
• 17B • Updated • 11.7k
• 10
necroyancer/gemma-4-31B-it-NVFP4-turbo-vision
Image-Text-to-Text
• 33B • Updated • 2.27k
• 2
lukealonso/MiMo-V2.5-NVFP4
179B • Updated • 29.9k
• 18
AEON-7/Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored-NVFP4
Any-to-Any
• 20B • Updated • 4.23k
• 6
llmfan46/Qwen3.6-35B-A3B-uncensored-heretic-NVFP4-Experts-Only
Image-Text-to-Text
• Updated • 907
• 2
gsting/Huihui-Qwen3-Coder-Next-abliterated-FP8
80B • Updated • 18
• 1
Kaleto/Anubis-Pro-105B-NVFP4
Text Generation
• 60B • Updated • 332
• 1
switzerchees/ZAYA1-8B-NVFP4
Text Generation
• 5B • Updated • 251
• 1
AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4
Text Generation
• 18B • Updated • 4.48k
• 10
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B • Updated • 179k
• 32
nvidia/Llama-4-Maverick-17B-128E-Instruct-FP8
402B • Updated • 750
• 14
ishan24/test_modelopt_quant
jiangchengchengNLP/L3.3-MS-Nevoria-70b-FP8
Text Generation
• 71B • Updated • 4
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
• 16B • Updated • 545
• 12
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
• 16B • Updated • 7.76k
• 27
gesong2077/Qwen3-32B-NVFP4
19B • Updated • 4
• 1
54B • Updated nvidia/Phi-4-multimodal-instruct-NVFP4
4B • Updated • 9.34k
• 11
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated • 661
• 7
nvidia/Phi-4-reasoning-plus-NVFP4
8B • Updated • 970
• 9
Text Generation
• 5B • Updated • 53.8k
• 17
Text Generation
• 8B • Updated • 47.2k
• 11
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated • 1.26k
• 8
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
• 5B • Updated • 234k
• 15