Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/joint diarization and embedding with prepared data #1583

Draft
wants to merge 103 commits into
base: develop
Choose a base branch
from

Conversation

clement-pages
Copy link
Collaborator

No description provided.

chai3 and others added 30 commits June 8, 2023 08:42
BREAKING(model): get rid of (flaky) `Model.introspection`
- fixes the dimension error between files id and probabilties arrays
- changes the way of how chunks for the embedding task are sampled
- creates two functions to draw chunks, one for each subtask

Tests are required to ensure that there are no bugs
For now this is a copy past from methods in segmentation task.
as computing this loss probably does not make sense in powerset
mode because first class (empty set of labels) does exactly this
as this instance attribute was not used
as these loop could break gradient flow and to optimize
the code
for now do the trick only for the diarization subtask
@hbredin
Copy link
Member

hbredin commented Jul 8, 2024

I just pushed a (possibly buggy) pipeline that seems to work with a joint model

from pyannote.audio.pipelines.speaker_diarization import SpeakerDiarizationV2
import torch

device = torch.device('cuda')
pipeline = SpeakerDiarizationV2('/path/to/joint.ckpt', batch_size=1, step=0.2).to(device)

# parameters obviously need to be optimized
pipeline.instantiate({'clustering': {'threshold': 0.75, 'method': 'centroid', 'min_cluster_size': 1}})

diarization = pipeline('/path/to/audio.wav')

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants