Make sure to install it to run the models (note aqlm works only with python>=3.10): pip install aqlm[gpu,cpu] The library provides efficient kernels for both GPU and CPU inference.