Speaker diarization
Relies on pyannote.audio 2.0 currently in development: see installation instructions.
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("AMITKESARI2000/pyannote_SD1")
output = pipeline("audio.wav")
for turn, _, speaker in output.itertracks(yield_label=True):
    # speaker speaks between turn.start and turn.end
    ...
Benchmark
| Dataset | Diarization error rate | 
|---|---|
| AMI only_wordsevaluation set | 21.4% | 
- Downloads last month
- -
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	🙋
			
		Ask for provider support
