gentoo.mahdi.cz
dev-python
:flexgen
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.
https://github.com/FMInference/FlexGen
flexgen-1.0
download
~amd64 ~x86
pypi
flexgen-0.1.8
download
~amd64 ~x86
pypi