Search Portage & Overlays:
Newest
News
Repository news
GLSAs
Browse
USE Flags
Overlays
More...
dev-python
/flexgen
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.
Screenshots
https://github.com/FMInference/FlexGen
flexgen-1.0
~amd64 ~x86
python_targets_python3_11 python_targets_python3_12 python_targets_python3_13 python_targets_python3_14
View
Download
Browse
License:
Overlay:
pypi
flexgen-0.1.8
~amd64 ~x86
python_targets_python3_11 python_targets_python3_12 python_targets_python3_13 python_targets_python3_14
View
Download
Browse
License:
Overlay:
pypi
ChangeLog
USE Flags
Dependencies
Reverse Deps
Related Bugs