Qwodel
Getting Started

Welcome to Qwodel — production-grade model quantization for Python.

This section will get you from zero to a quantized model in minutes.

In this section

PageWhat you'll learn
InstallationHow to install Qwodel and the backend extras you need
Quick StartYour first quantization in 5 minutes

What is Qwodel?

Qwodel is a unified Python library for quantizing large language models (LLMs). It wraps three battle-tested quantization backends behind one clean API:

BackendHardwareBest for
GGUFCPUBroad compatibility, llama.cpp runtimes
AWQNVIDIA GPUMaximum GPU inference speed
CoreMLApple SiliconiOS, macOS, iPadOS deployment

Next: Installation →