Automatic Speech Recognition
pyannote.audio
pyannote
pyannote-audio-pipeline
audio
voice
speech
speaker
speaker-diarization
speaker-change-detection
voice-activity-detection
overlapped-speech-detection
Instructions to use aTrain-core/speaker-detection with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- pyannote.audio
How to use aTrain-core/speaker-detection with pyannote.audio:
from pyannote.audio import Pipeline pipeline = Pipeline.from_pretrained("aTrain-core/speaker-detection") # inference on the whole file pipeline("file.wav") # inference on an excerpt from pyannote.core import Segment excerpt = Segment(start=2.0, end=5.0) from pyannote.audio import Audio waveform, sample_rate = Audio().crop("file.wav", excerpt) pipeline({"waveform": waveform, "sample_rate": sample_rate}) - Notebooks
- Google Colab
- Kaggle
- Xet hash:
- e5f31c78169dc3916a8aac05d8daa83705d7659515c5b34780edf09c3e5b9594
- Size of remote file:
- 134 kB
- SHA256:
- 9b77bcd840692710dd3496f62ecfeed8d8e5f002fd991b785079b244eab7d255
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.