Backends

Qwodel wraps three quantization backends. Each backend page covers its supported formats, initialization parameters, and runtime overrides.

In this section

Page	Backend	Hardware
GGUF	llama.cpp-compatible quantization	CPU
AWQ	Activation-Aware Weight Quantization	NVIDIA GPU
CoreML	Apple on-device inference	Apple Silicon / Intel Mac

Not sure which to pick? See Choosing a Backend →

Progress Callbacks

GGUF Backend