gentoo.mahdi.cz

  

dev-python:qllm


A general x-bit quantization engine for LLMs,[2-8] bits, awq/gptq/hqq [wheel]
https://github.com/wejoncy/QLLM

qllm-0.1.8Apache-2.0
download~amd64 ~x86pypi