-
-
-
-
-
-
Inference Providers
Active filters:
ollama
DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated
•
157
Manojb/Qwen3-4B-toolcalling-gguf-codex
Text Generation
•
4B
•
Updated
•
1.76k
•
47
vito95311/Qwen3-Omni-30B-A3B-Thinking-GGUF-INT8FP16
Text Generation
•
16B
•
Updated
•
424
•
18
distil-labs/distil-qwen3-4b-text2sql-gguf-4bit
Text Generation
•
4B
•
Updated
•
113
•
2
amihai4by/logic-reasoner-v2
Text Generation
•
8B
•
Updated
•
54
•
2
Novaciano/Triangulum-1B-DPO_Roleplay_NSFW-GGUF
Text Generation
•
1B
•
Updated
•
314
•
6
mirazrafi/NSFW-RP-RolePlay-LoRA-ArliAI-Llama-3.1-8B
Text Generation
•
Updated
•
272
•
5
pacozaa/mistral-unsloth-chatml-first
4B
•
Updated
•
117
pacozaa/tinyllama-alpaca-lora
7B
•
Updated
•
49
pacozaa/TinyLlama-1.1B-intermediate-step-1431k-3T-GGUF
1B
•
Updated
•
112
pacozaa/mistral-sharegpt90k
Updated
pacozaa/mistral-sharegpt90k-merged_16bit
Text Generation
•
7B
•
Updated
•
48
TrabEsrever/dolphin-2.9-llama3-70b-GGUF
Updated
daekeun-ml/Phi-3-medium-4k-instruct-ko-poc-gguf-v0.1
Text Generation
•
14B
•
Updated
•
21
•
1
hierholzer/Llama-3.1-70B-Instruct-GGUF
Text Generation
•
71B
•
Updated
•
92
•
3
LucasInsight/Meta-Llama-3.1-8B-Instruct
8B
•
Updated
•
26
•
1
LucasInsight/Meta-Llama-3-8B-Instruct
8B
•
Updated
•
53
Shyamnath/Llama-3.2-3b-Uncensored-GGUF
Text Generation
•
4B
•
Updated
•
137
•
4
ghost-x/ghost-8b-beta-1608-gguf
Text Generation
•
8B
•
Updated
•
137
•
6
cahaj/Phi-3.5-mini-instruct-text2sql-GGUF
4B
•
Updated
•
26
Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python_Spanish_English_16bit
0.5B
•
Updated
Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-extra_small_quantization_GGUF_3bit
0.5B
•
Updated
Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python-Spanish_English_GGUF_4bit
0.5B
•
Updated
Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_q5_k
0.5B
•
Updated
Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_q6_k
0.5B
•
Updated
Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python-GGUF_Spanish_English_8bit
0.5B
•
Updated
Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python_English_GGUF_16bit
0.5B
•
Updated
•
1
Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_32bit
Updated
3B
•
Updated
•
16