sci-ml/pyannote-audio
State-of-the-art speaker diarization toolkit (PyTorch)
ChangeLog
commit 09a3601e682611210790bfead087f5016a7ba07e
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Mon May 18 19:43:44 2026 +0200
sci-ml/pyannote-audio: use canonical PyPI slug (pyannote-audio)
pypi.org/project/pyannote.audio/ permanently redirects to
/pyannote-audio/. Switch HOMEPAGE + metadata.xml pypi remote-id
to the hyphen form. PYPI_PN stays "pyannote.audio" (the sdist
filename still uses the dot form).
commit 4138521faf073de9deffcaa3e515c1acdf265ae3
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Sat May 16 10:52:40 2026 +0200
sci-ml/pyannote-audio: new package, add 4.0.4
State-of-the-art speaker diarization, voice activity detection,
overlapped speech detection, and speaker embedding toolkit.
Sits on top of the freshly-landed lightning/torchcodec/torchmetrics/
pyannote-{core,database,metrics,pipeline}/pyannoteai-sdk stack
(16 sibling deps in this commit series).
The Python package alone is enough to import; running the
pretrained pyannote/speaker-diarization-3.1 pipeline at runtime
additionally requires a HuggingFace token and accepting the model
terms on huggingface.co/pyannote — that's a per-user setup step,
not something the ebuild can do.
SINGLE_IMPL via pytorch. Tests restricted (papermill + downloaded
HF model bundle); local build-check verified with USE=-cuda.
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Mon May 18 19:43:44 2026 +0200
sci-ml/pyannote-audio: use canonical PyPI slug (pyannote-audio)
pypi.org/project/pyannote.audio/ permanently redirects to
/pyannote-audio/. Switch HOMEPAGE + metadata.xml pypi remote-id
to the hyphen form. PYPI_PN stays "pyannote.audio" (the sdist
filename still uses the dot form).
commit 4138521faf073de9deffcaa3e515c1acdf265ae3
Author: Ivan S. Titov <iohann.s.titov@gmail.com>
Date: Sat May 16 10:52:40 2026 +0200
sci-ml/pyannote-audio: new package, add 4.0.4
State-of-the-art speaker diarization, voice activity detection,
overlapped speech detection, and speaker embedding toolkit.
Sits on top of the freshly-landed lightning/torchcodec/torchmetrics/
pyannote-{core,database,metrics,pipeline}/pyannoteai-sdk stack
(16 sibling deps in this commit series).
The Python package alone is enough to import; running the
pretrained pyannote/speaker-diarization-3.1 pipeline at runtime
additionally requires a HuggingFace token and accepting the model
terms on huggingface.co/pyannote — that's a per-user setup step,
not something the ebuild can do.
SINGLE_IMPL via pytorch. Tests restricted (papermill + downloaded
HF model bundle); local build-check verified with USE=-cuda.


View
Download
Browse