4-bit GPTQ quantized version of cogito-v1-preview-qwen-32B for use with the Private LLM app.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for numen-tech/cogito-v1-preview-qwen-32B-GPTQ-Int4

Base model

Qwen/Qwen2.5-32B
Quantized
(21)
this model

Paper for numen-tech/cogito-v1-preview-qwen-32B-GPTQ-Int4