gentoo.mahdi.cz

  

dev-python:flexgen


Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.
https://github.com/FMInference/FlexGen

flexgen-1.0
download~amd64 ~x86pypi
flexgen-0.1.8
download~amd64 ~x86pypi