pyannote.audio Neural Speaker Diarization Toolkit
pyannote.audio is an open-source Python toolkit for speaker diarization built on PyTorch. It provides state-of-the-art pretrained models and pipelines for speech activity detection, speaker segmentation, overlapped speech detection, and speaker embedding.
What it does
pyannote.audio Neural Speaker Diarization Toolkit
pyannote.audio is an open-source Python toolkit for speaker diarization built on PyTorch. It provides state-of-the-art pretrained models and pipelines for speech activity detection, speaker segmentation, overlapped speech detection, and speaker embedding.
Installation
Use the upstream install or setup path that matches your environment:
- pip install -e .[dev,testing]
Requirements and caveats from upstream:
- pyannote.audio is an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it comes with state-of-the-art [pretrained models and pipelines](...
- :snake: Python-first API
- python
Basic usage or getting-started notes:
-
With the optional telemetry feature in pyannote.audio, you can choose to send anonymous usage metrics to help the pyannote team improve the library.
-
Extracted from upstream docs: https://raw.githubusercontent.com/pyannote/pyannote-audio/HEAD/README.md
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,197 chars)