Models

202

Full-text search

Active filters: vLLM

unsloth/Mistral-Small-4-119B-2603-GGUF

119B • Updated 5 days ago • 32.1k • 65

QuantTrio/Qwen3.6-35B-A3B-AWQ

Image-Text-to-Text • 36B • Updated 8 days ago • 97.3k • 13

QuantTrio/MiniMax-M2.7-AWQ

Text Generation • 229B • Updated 4 days ago • 22.5k • 5

QuantTrio/gemma-4-31B-it-AWQ

Image-Text-to-Text • 31B • Updated 8 days ago • 125k • 10

mistralai/Mistral-Small-4-119B-2603

119B • Updated 3 days ago • 61.5k • 361

QuantTrio/GLM-5.1-AWQ

Text Generation • 754B • Updated 3 days ago • 307 • 3

QuantTrio/Qwen3.5-27B-AWQ

Image-Text-to-Text • 28B • Updated Mar 2 • 403k • 43

selode-ai/Qwen-3.6-35B-A3B-VRAP-4-bit-AWQ-21.2GB

Image-Text-to-Text • 29B • Updated 3 days ago • 91 • 11

QuantTrio/Qwen3.6-27B-AWQ

Image-Text-to-Text • 28B • Updated 2 days ago • 7.18k • 2

QuantTrio/GLM-4.7-Flash-AWQ

Text Generation • 31B • Updated Jan 21 • 103k • 12

QuantTrio/MiniMax-M2.5-AWQ

Text Generation • 229B • Updated Feb 16 • 89.3k • 15

QuantTrio/Qwen3.5-35B-A3B-AWQ

Image-Text-to-Text • 36B • Updated Feb 26 • 257k • 18

QuantTrio/Qwen3.5-122B-A10B-AWQ

Image-Text-to-Text • 125B • Updated Feb 26 • 69.9k • 26

QuantTrio/GLM-5-AWQ

Text Generation • 586B • Updated Feb 28 • 4.13k • 6

unsloth/Mistral-Small-4-119B-2603

119B • Updated 6 days ago • 240 • 4

Xingyu-Zheng/Qwopus3.5-27B-v3.5-INT4-FOEM

Image-Text-to-Text • 27B • Updated 8 days ago • 562 • 1

QuantTrio/Qwen3.6-27B-AWQ-6Bit

Image-Text-to-Text • 28B • Updated 2 days ago • 1.04k • 1

model-scope/glm-4-9b-chat-GPTQ-Int4

Text Generation • 9B • Updated Jul 17, 2024 • 129 • 6

model-scope/glm-4-9b-chat-GPTQ-Int8

Text Generation • 9B • Updated Jul 23, 2024 • 4 • 2

tclf90/qwen2.5-72b-instruct-gptq-int4

Text Generation • 73B • Updated May 12, 2025 • 105 • 2

tclf90/qwen2.5-72b-instruct-gptq-int3

Text Generation • 69B • Updated May 12, 2025 • 77

prithivMLmods/Nu2-Lupi-Qwen-14B

Text Generation • 15B • Updated Mar 27, 2025 • 5 • 2

mradermacher/Nu2-Lupi-Qwen-14B-GGUF

15B • Updated Jul 11, 2025 • 143 • 1

mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF

15B • Updated Jul 11, 2025 • 270 • 1

JunHowie/Qwen3-0.6B-GPTQ-Int4

Text Generation • 0.6B • Updated Sep 3, 2025 • 160 • 1

JunHowie/Qwen3-0.6B-GPTQ-Int8

Text Generation • 0.6B • Updated Sep 3, 2025 • 6

JunHowie/Qwen3-1.7B-GPTQ-Int4

Text Generation • 2B • Updated Sep 3, 2025 • 123 • 1

JunHowie/Qwen3-1.7B-GPTQ-Int8

Text Generation • 2B • Updated Sep 3, 2025 • 10

JunHowie/Qwen3-32B-GPTQ-Int4

Text Generation • 33B • Updated Sep 5, 2025 • 28.2k • 4

JunHowie/Qwen3-32B-GPTQ-Int8

Text Generation • 33B • Updated Sep 5, 2025 • 228 • 4