gpo.zugaina.org

Search Portage & Overlays:

dev-python/flash-attn

Fast and memory-efficient exact attention (FlashAttention-2)

Screenshots

  • flash-attn-2.8.3_p1
    ~amd64
    python_single_target_python3_12 python_single_target_python3_13

    View      Download      Browse     License: BSD   
    Overlay: stuff

ChangeLog

commit 41c6cc60fe28b27cf770092112bdb8af84c811af
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Thu Jun 18 20:45:35 2026 +0200

dev-python/flash-attn: drop 2.8.3

commit 0aafac8cb2029acf9eee4280f0081bde1534dd2b
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Thu Jun 18 20:44:15 2026 +0200

dev-python/flash-attn: add 2.8.3_p1

commit e59a6b5d89df7a1e0f72333cedd2fb8e336c40fa
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Wed Jun 17 10:09:16 2026 +0200

dev-python/flash-attn: new package, add 2.8.3

Provides flash_attn_varlen_func, required by the TRELLIS sparse-attention path.
The sdist omits its CUTLASS submodule, so a second SRC_URI stages CUTLASS at
the pinned commit (== 4.0.0) into csrc/cutlass during src_prepare. MAX_JOBS is
held at 2: the backward hdim128/256 kernels OOM this host at higher job counts.