Automatic Speech Recognition
pyannote.audio
pyannote
pyannote-audio-pipeline
audio
voice
speech
speaker
speaker-diarization
speaker-change-detection
voice-activity-detection
overlapped-speech-detection
Instructions to use hicustomer/pyannote-speaker-diarization with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- pyannote.audio
How to use hicustomer/pyannote-speaker-diarization with pyannote.audio:
from pyannote.audio import Pipeline pipeline = Pipeline.from_pretrained("hicustomer/pyannote-speaker-diarization") # inference on the whole file pipeline("file.wav") # inference on an excerpt from pyannote.core import Segment excerpt = Segment(start=2.0, end=5.0) from pyannote.audio import Audio waveform, sample_rate = Audio().crop("file.wav", excerpt) pipeline({"waveform": waveform, "sample_rate": sample_rate}) - Notebooks
- Google Colab
- Kaggle
| # generated by rye | |
| # use `rye lock` or `rye sync` to update this lockfile | |
| # | |
| # last locked with the following flags: | |
| # pre: false | |
| # features: [] | |
| # all-features: false | |
| -e file:. | |
| absl-py==1.4.0 | |
| aiohttp==3.8.5 | |
| aiosignal==1.3.1 | |
| alembic==1.11.2 | |
| antlr4-python3-runtime==4.9.3 | |
| asteroid-filterbanks==0.4.0 | |
| async-timeout==4.0.2 | |
| attrs==23.1.0 | |
| audioread==3.0.0 | |
| backports-cached-property==1.0.2 | |
| cachetools==5.3.1 | |
| certifi==2023.7.22 | |
| cffi==1.15.1 | |
| charset-normalizer==3.2.0 | |
| click==8.1.6 | |
| cmaes==0.10.0 | |
| colorama==0.4.6 | |
| colorlog==6.7.0 | |
| contourpy==1.1.0 | |
| cycler==0.11.0 | |
| decorator==5.1.1 | |
| docopt==0.6.2 | |
| einops==0.3.2 | |
| filelock==3.12.2 | |
| fonttools==4.42.0 | |
| frozenlist==1.4.0 | |
| fsspec==2023.6.0 | |
| google-auth==2.22.0 | |
| google-auth-oauthlib==1.0.0 | |
| greenlet==2.0.2 | |
| grpcio==1.56.2 | |
| hmmlearn==0.2.8 | |
| huggingface-hub==0.16.4 | |
| hyperpyyaml==1.2.1 | |
| idna==3.4 | |
| importlib-metadata==6.8.0 | |
| importlib-resources==6.0.1 | |
| joblib==1.3.1 | |
| julius==0.2.7 | |
| kiwisolver==1.4.4 | |
| librosa==0.9.2 | |
| llvmlite==0.40.1 | |
| mako==1.2.4 | |
| markdown==3.4.4 | |
| markdown-it-py==3.0.0 | |
| markupsafe==2.1.3 | |
| matplotlib==3.7.2 | |
| mdurl==0.1.2 | |
| mpmath==1.3.0 | |
| multidict==6.0.4 | |
| networkx==2.8.8 | |
| numba==0.57.1 | |
| numpy==1.24.4 | |
| oauthlib==3.2.2 | |
| omegaconf==2.3.0 | |
| optuna==3.3.0 | |
| packaging==23.1 | |
| pandas==2.0.3 | |
| pillow==10.0.0 | |
| platformdirs==3.10.0 | |
| pooch==1.7.0 | |
| primepy==1.3 | |
| protobuf==3.20.1 | |
| pyannote-audio @ git+https://github.com/pyannote/pyannote-audio.git@2.1.1 | |
| pyannote-core==4.5 | |
| pyannote-database==4.1.3 | |
| pyannote-metrics==3.2.1 | |
| pyannote-pipeline==2.3 | |
| pyasn1==0.5.0 | |
| pyasn1-modules==0.3.0 | |
| pycparser==2.21 | |
| pydeprecate==0.3.2 | |
| pygments==2.16.1 | |
| pyparsing==3.0.9 | |
| python-dateutil==2.8.2 | |
| pytorch-lightning==1.6.5 | |
| pytorch-metric-learning==1.7.3 | |
| pytz==2023.3 | |
| pyyaml==6.0.1 | |
| requests==2.31.0 | |
| requests-oauthlib==1.3.1 | |
| resampy==0.4.2 | |
| rich==13.5.2 | |
| rsa==4.9 | |
| ruamel-yaml==0.17.28 | |
| ruamel-yaml-clib==0.2.7 | |
| scikit-learn==1.3.0 | |
| scipy==1.11.1 | |
| semver==2.13.0 | |
| sentencepiece==0.1.99 | |
| shellingham==1.5.0.post1 | |
| simplejson==3.19.1 | |
| singledispatchmethod==1.0 | |
| six==1.16.0 | |
| sortedcontainers==2.4.0 | |
| soundfile==0.10.3.post1 | |
| speechbrain==0.5.15 | |
| sqlalchemy==2.0.19 | |
| sympy==1.12 | |
| tabulate==0.9.0 | |
| tensorboard==2.13.0 | |
| tensorboard-data-server==0.7.1 | |
| threadpoolctl==3.2.0 | |
| torch==1.11.0 | |
| torch-audiomentations==0.11.0 | |
| torch-pitch-shift==1.2.4 | |
| torchaudio==0.11.0 | |
| torchmetrics==0.11.4 | |
| torchvision==0.12.0 | |
| tqdm==4.65.0 | |
| typer==0.9.0 | |
| typing-extensions==4.7.1 | |
| tzdata==2023.3 | |
| urllib3==1.26.16 | |
| werkzeug==2.3.6 | |
| wheel==0.41.1 | |
| yarl==1.9.2 | |
| zipp==3.16.2 | |
| # The following packages are considered to be unsafe in a requirements file: | |
| setuptools==68.0.0 | |