Home Blog Projects CV

Home/Glossary/quantization

quantization

The process of reducing the numerical precision of model weights (e.g., from float32 to int8) to decrease memory footprint and improve inference speed, often with minimal accuracy loss.

© 2026 Matteo Dabbene

GitHub LinkedIn CV