gpo.zugaina.org

Search Portage & Overlays:

dev-python/flashinfer-cubin

Pre-compiled cubins for FlashInfer kernels

Screenshots

  • flashinfer-cubin-0.6.8_p1
    -* ~amd64
    python_targets_python3_11 python_targets_python3_12 python_targets_python3_13 python_targets_python3_14

    View      Download      Browse     License: all-rights-reserved   
    Overlay: stuff

ChangeLog

commit eb22ce35bf24a0caa261fe42967b3cb435739d7d
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Thu May 7 15:33:12 2026 +0200

dev-python/flashinfer-cubin: new package, 0.6.8.post1

Tier 0 leaf for the vllm CUDA target packaging cycle. Pre-compiled
NVIDIA cubins for FlashInfer kernels — shipped only as a single
py3-none-any wheel on PyPI; no upstream source. Required as a runtime
sidecar by flashinfer-python.

Gentoo's PMS version syntax forbids ".postN", so PyPI's "0.6.8.post1"
is encoded as Gentoo PV "0.6.8_p1" with MY_PV translating back to
the PyPI form for the wheel filename.

The wheel is large (~280 MiB) — pre-compiled cubins for many CUDA
arches. Marked QA_PREBUILT and bindist-restricted.