This is a quantized GGUF version of @intel neural-chat 3.1 7B quantized to 4_0 and 8_0 bits.
(link to the original model : https://huggingface.co/Intel/neural-chat-7b-v3-1/)
- Downloads last month
- 37
Hardware compatibility
Log In to add your hardware
4-bit
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support