Gentoo Portage Overlays - sci-ml/pyannote-audio

sci-ml/pyannote-audio

State-of-the-art speaker diarization toolkit (PyTorch)

Screenshots

pyannote-audio-4.0.4

~amd64

cli python_single_target_python3_12 python_single_target_python3_13 python_single_target_python3_14

View

Download

Browse License: MIT

Overlay: stuff

ChangeLog USE Flags Dependencies Reverse Deps Related Bugs

ChangeLog

commit 09a3601e682611210790bfead087f5016a7ba07e
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Mon May 18 19:43:44 2026 +0200

sci-ml/pyannote-audio: use canonical PyPI slug (pyannote-audio)

pypi.org/project/pyannote.audio/ permanently redirects to
/pyannote-audio/. Switch HOMEPAGE + metadata.xml pypi remote-id
to the hyphen form. PYPI_PN stays "pyannote.audio" (the sdist
filename still uses the dot form).

commit 4138521faf073de9deffcaa3e515c1acdf265ae3
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Sat May 16 10:52:40 2026 +0200

sci-ml/pyannote-audio: new package, add 4.0.4

State-of-the-art speaker diarization, voice activity detection,
overlapped speech detection, and speaker embedding toolkit.

Sits on top of the freshly-landed lightning/torchcodec/torchmetrics/
pyannote-{core,database,metrics,pipeline}/pyannoteai-sdk stack
(16 sibling deps in this commit series).

The Python package alone is enough to import; running the
pretrained pyannote/speaker-diarization-3.1 pipeline at runtime
additionally requires a HuggingFace token and accepting the model
terms on huggingface.co/pyannote — that's a per-user setup step,
not something the ebuild can do.

SINGLE_IMPL via pytorch. Tests restricted (papermill + downloaded
HF model bundle); local build-check verified with USE=-cuda.