-
-
Notifications
You must be signed in to change notification settings - Fork 780
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feat/joint diarization and embedding with prepared data #1583
base: develop
Are you sure you want to change the base?
Feat/joint diarization and embedding with prepared data #1583
Commits on Jun 8, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 0551070 - Browse repository at this point
Copy the full SHA 0551070View commit details
Commits on Jun 12, 2023
-
feat(task): add support for multi-task models (pyannote#1374)
BREAKING(model): get rid of (flaky) `Model.introspection`
Configuration menu - View commit details
-
Copy full SHA for 30ddb0b - Browse repository at this point
Copy the full SHA 30ddb0bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4eb7190 - Browse repository at this point
Copy the full SHA 4eb7190View commit details
Commits on Jun 15, 2023
-
Configuration menu - View commit details
-
Copy full SHA for dcdfc15 - Browse repository at this point
Copy the full SHA dcdfc15View commit details
Commits on Jun 17, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 87f49f9 - Browse repository at this point
Copy the full SHA 87f49f9View commit details -
Merge branch 'develop' of github.com:clement-pages/pyannote-audio int…
…o feat/joint-diarization-and-embedding
Configuration menu - View commit details
-
Copy full SHA for 6025a80 - Browse repository at this point
Copy the full SHA 6025a80View commit details
Commits on Jun 19, 2023
-
update
train__iter__helper
method of the joint task- fixes the dimension error between files id and probabilties arrays - changes the way of how chunks for the embedding task are sampled - creates two functions to draw chunks, one for each subtask Tests are required to ensure that there are no bugs
Configuration menu - View commit details
-
Copy full SHA for 58599c9 - Browse repository at this point
Copy the full SHA 58599c9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 04de82f - Browse repository at this point
Copy the full SHA 04de82fView commit details -
For now this is a copy past from methods in segmentation task.
Configuration menu - View commit details
-
Copy full SHA for d8cb598 - Browse repository at this point
Copy the full SHA d8cb598View commit details -
Configuration menu - View commit details
-
Copy full SHA for d2d6e14 - Browse repository at this point
Copy the full SHA d2d6e14View commit details -
as computing this loss probably does not make sense in powerset mode because first class (empty set of labels) does exactly this
Configuration menu - View commit details
-
Copy full SHA for e58943b - Browse repository at this point
Copy the full SHA e58943bView commit details -
Configuration menu - View commit details
-
Copy full SHA for bc989cd - Browse repository at this point
Copy the full SHA bc989cdView commit details -
Configuration menu - View commit details
-
Copy full SHA for b4d0a78 - Browse repository at this point
Copy the full SHA b4d0a78View commit details
Commits on Jun 20, 2023
-
1
Configuration menu - View commit details
-
Copy full SHA for 78718b1 - Browse repository at this point
Copy the full SHA 78718b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for dfdd8f3 - Browse repository at this point
Copy the full SHA dfdd8f3View commit details -
4
Configuration menu - View commit details
-
Copy full SHA for 1888360 - Browse repository at this point
Copy the full SHA 1888360View commit details
Commits on Jun 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6216d1f - Browse repository at this point
Copy the full SHA 6216d1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for b42cc33 - Browse repository at this point
Copy the full SHA b42cc33View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3d295dd - Browse repository at this point
Copy the full SHA 3d295ddView commit details
Commits on Jun 22, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3363be6 - Browse repository at this point
Copy the full SHA 3363be6View commit details
Commits on Jun 23, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 99a7762 - Browse repository at this point
Copy the full SHA 99a7762View commit details -
remove
diarization_database_files
attributeas this instance attribute was not used
Configuration menu - View commit details
-
Copy full SHA for f2a4e34 - Browse repository at this point
Copy the full SHA f2a4e34View commit details -
feat(pipeline): add
return_embeddings
option to `SpeakerDiarization……` pipeline Co-authored-by: Hervé BREDIN <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 017c910 - Browse repository at this point
Copy the full SHA 017c910View commit details
Commits on Jun 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for cf0e3b3 - Browse repository at this point
Copy the full SHA cf0e3b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for f48b74f - Browse repository at this point
Copy the full SHA f48b74fView commit details
Commits on Jun 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f393546 - Browse repository at this point
Copy the full SHA f393546View commit details -
remove for loops in embedding loss computation
as these loop could break gradient flow and to optimize the code
Configuration menu - View commit details
-
Copy full SHA for 5718593 - Browse repository at this point
Copy the full SHA 5718593View commit details
Commits on Jul 3, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 8036572 - Browse repository at this point
Copy the full SHA 8036572View commit details
Commits on Jul 4, 2023
-
Configuration menu - View commit details
-
Copy full SHA for aa36d7b - Browse repository at this point
Copy the full SHA aa36d7bView commit details -
for now do the trick only for the diarization subtask
Configuration menu - View commit details
-
Copy full SHA for 6617c9c - Browse repository at this point
Copy the full SHA 6617c9cView commit details
Commits on Jul 5, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 60d5543 - Browse repository at this point
Copy the full SHA 60d5543View commit details
Commits on Jul 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2834d3e - Browse repository at this point
Copy the full SHA 2834d3eView commit details
Commits on Jul 9, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 35be745 - Browse repository at this point
Copy the full SHA 35be745View commit details
Commits on Jul 11, 2023
-
fix size issue in
collate_y
when building embedding refThere was an issue when the number of speakers in a chunk was greater than the maximum number per chunk set for the task.
Configuration menu - View commit details
-
Copy full SHA for 5628b48 - Browse repository at this point
Copy the full SHA 5628b48View commit details
Commits on Jul 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c4988f4 - Browse repository at this point
Copy the full SHA c4988f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 75467f0 - Browse repository at this point
Copy the full SHA 75467f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 78b5b04 - Browse repository at this point
Copy the full SHA 78b5b04View commit details -
remove redefinitions of
collate_X
andcollate_meta
these two methods were identical to the methods inherited from the `SegmentationTaskMixin` class
Configuration menu - View commit details
-
Copy full SHA for bdf3567 - Browse repository at this point
Copy the full SHA bdf3567View commit details -
add missing
dia_loss
assignmentand fix issue with the loss type during training
Configuration menu - View commit details
-
Copy full SHA for aae90a0 - Browse repository at this point
Copy the full SHA aae90a0View commit details
Commits on Jul 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d3b3efc - Browse repository at this point
Copy the full SHA d3b3efcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4289ea9 - Browse repository at this point
Copy the full SHA 4289ea9View commit details
Commits on Jul 19, 2023
-
add V2 of
SpeakerEndToEndDiarization
this version replace `StatsPool` by a concatenation of the last outputs of TDNN (for the embedding part) and LSTM (for the diarization part) and a LSTM layer
7Configuration menu - View commit details
-
Copy full SHA for e9f40a3 - Browse repository at this point
Copy the full SHA e9f40a3View commit details
Commits on Jul 26, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3f7cb8a - Browse repository at this point
Copy the full SHA 3f7cb8aView commit details -
update LSTM encoder in SPEED V2
Now, this LSTM is bidirectionnal and has a hidden size of 1500, so the outputs shape of this encoder is (b, s, 1500*2). This will allow comparing with `StatsPool` version of the SPEED model
Configuration menu - View commit details
-
Copy full SHA for 0f1577d - Browse repository at this point
Copy the full SHA 0f1577dView commit details
Commits on Oct 13, 2023
-
add
prepare_data
method inTask
classThe goal of this method is to generate the data needed by the task and save it on disk for future uses, for example by the `setup` method. The objective is to avoid systematically recreating data on each process at the beginning of a training
clement-pages committedOct 13, 2023 Configuration menu - View commit details
-
Copy full SHA for 933a660 - Browse repository at this point
Copy the full SHA 933a660View commit details
Commits on Oct 26, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 5257145 - Browse repository at this point
Copy the full SHA 5257145View commit details
Commits on Nov 2, 2023
-
modify organisation of
pyannote
segmentation tasksNow all the segmentations tasks in `pyannote` inherit the `SegmentationTask` (previously `SegmentationTaskMixin`), which inherits the `Task` class. This commit also adds a `prepared_data` attribute to the `Task` class. That attribute is a dict which contains all the prepared data by the `prepare_data` method.
clement-pages committedNov 2, 2023 Configuration menu - View commit details
-
Copy full SHA for 8829574 - Browse repository at this point
Copy the full SHA 8829574View commit details -
Merge branch 'feat/data_preparation' of github.com:clement-pages/pyan…
…note-audio into feat/data_preparation
clement-pages committedNov 2, 2023 Configuration menu - View commit details
-
Copy full SHA for fa63c8a - Browse repository at this point
Copy the full SHA fa63c8aView commit details
Commits on Nov 7, 2023
-
One for the test of the `MultiLabelSegmentation` task, and the other for the test of the `SupervisedRepresentationLearningWithArcFace` task.
clement-pages committedNov 7, 2023 Configuration menu - View commit details
-
Copy full SHA for be6f7ec - Browse repository at this point
Copy the full SHA be6f7ecView commit details -
assign data directly to task in main process, in
prepare_data
This eliminates the need to reload pickle data in setup when in the main process
clement-pages committedNov 7, 2023 Configuration menu - View commit details
-
Copy full SHA for f447bb6 - Browse repository at this point
Copy the full SHA f447bb6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 930deda - Browse repository at this point
Copy the full SHA 930dedaView commit details
Commits on Nov 8, 2023
-
handle call to
Task.prepare_data
andTask.setup
under different s……cenarios
clement-pages committedNov 8, 2023 Configuration menu - View commit details
-
Copy full SHA for 05ccc30 - Browse repository at this point
Copy the full SHA 05ccc30View commit details -
Merge branch 'feat/data_preparation' of github.com:clement-pages/pyan…
…note-audio into feat/data_preparation
clement-pages committedNov 8, 2023 Configuration menu - View commit details
-
Copy full SHA for 44a01fe - Browse repository at this point
Copy the full SHA 44a01feView commit details
Commits on Nov 9, 2023
-
add training tests using task caches
clement-pages committedNov 9, 2023 Configuration menu - View commit details
-
Copy full SHA for 4b8e8a2 - Browse repository at this point
Copy the full SHA 4b8e8a2View commit details -
update
cache_path
type and docstringsclement-pages committedNov 9, 2023 Configuration menu - View commit details
-
Copy full SHA for 45918bd - Browse repository at this point
Copy the full SHA 45918bdView commit details -
fix
classes
variable used before assigmentThis issue occured when a list of classes was specified during `MultiLabelSegmentation` instanciation.
clement-pages committedNov 9, 2023 Configuration menu - View commit details
-
Copy full SHA for 980414e - Browse repository at this point
Copy the full SHA 980414eView commit details
Commits on Nov 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a9ea07f - Browse repository at this point
Copy the full SHA a9ea07fView commit details
Commits on Nov 15, 2023
-
Merge branch 'feat/joint-diarization-and-embedding' into feat/joint-d…
…iarization-and-embedding-with-prepared-data
clement-pages committedNov 15, 2023 Configuration menu - View commit details
-
Copy full SHA for 51a36f9 - Browse repository at this point
Copy the full SHA 51a36f9View commit details -
fix: fix residual merge problems
clement-pages committedNov 15, 2023 Configuration menu - View commit details
-
Copy full SHA for c1fbb81 - Browse repository at this point
Copy the full SHA c1fbb81View commit details
Commits on Nov 20, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 797a8a4 - Browse repository at this point
Copy the full SHA 797a8a4View commit details -
Merge branch 'pyannote:develop' into feat/joint-diarization-and-embed…
…ding-with-prepared-data
Configuration menu - View commit details
-
Copy full SHA for a17c2d0 - Browse repository at this point
Copy the full SHA a17c2d0View commit details
Commits on Nov 21, 2023
-
clement-pages committed
Nov 21, 2023 Configuration menu - View commit details
-
Copy full SHA for 987e702 - Browse repository at this point
Copy the full SHA 987e702View commit details
Commits on Nov 27, 2023
-
improve: use
numpy
method for w/r task cache insteadpickle
(#1)* use npz archive instead pickle to save task data * improve code readability * improve(task): update numpy array dtypes In order to use types whose size better machtes the contents of the arrays * remove `end` entry from `annotated_regions` numpy array This entry was redundant with the start and duration entries, since `end` = `start` + `duration`. * fix: allow data preparation to be finished when task has no validation * improve: clear data lists after assignation to `self.prepared_data` This is to avoid data redundancy in the `prepare_data` method --------- Co-authored-by: clement-pages <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 042dc43 - Browse repository at this point
Copy the full SHA 042dc43View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5358986 - Browse repository at this point
Copy the full SHA 5358986View commit details
Commits on Nov 29, 2023
-
improve: remove complete redefinition of
setup
in joint taskNow the joint task uses `prepare_data` and `setup` from core `Task` and `SpeakerDiarization` task.
clement-pages committedNov 29, 2023 Configuration menu - View commit details
-
Copy full SHA for 0011870 - Browse repository at this point
Copy the full SHA 0011870View commit details -
Merge branch 'feat/joint-diarization-and-embedding-with-prepared-data…
…' of github.com:clement-pages/pyannote-audio into feat/joint-diarization-and-embedding-with-prepared-data
clement-pages committedNov 29, 2023 Configuration menu - View commit details
-
Copy full SHA for 68763dc - Browse repository at this point
Copy the full SHA 68763dcView commit details -
Merge branch 'feat/data-preparation' into feat/joint-diarization-and-…
…embedding-with-prepared-data
clement-pages committedNov 29, 2023 Configuration menu - View commit details
-
Copy full SHA for 7d78548 - Browse repository at this point
Copy the full SHA 7d78548View commit details -
improve: remove duplicated attributes in `JointSpeakerDiarizationAndE…
…mbedding`
clement-pages committedNov 29, 2023 Configuration menu - View commit details
-
Copy full SHA for 6e6b62d - Browse repository at this point
Copy the full SHA 6e6b62dView commit details -
update: replace old
Task
attributes with prepared_data in joint taskclement-pages committedNov 29, 2023 Configuration menu - View commit details
-
Copy full SHA for e60873c - Browse repository at this point
Copy the full SHA e60873cView commit details -
improve: handle multi-speaker embeddings in
example_output
clement-pages committedNov 29, 2023 Configuration menu - View commit details
-
Copy full SHA for 40cc903 - Browse repository at this point
Copy the full SHA 40cc903View commit details
Commits on Nov 30, 2023
-
feat: add new end-to-end model for joint speaker diarization and embe…
…ddins This new model is based on a `WeSpeakerResnet34` for the speaker embeddings extraction part, and on `PyanNet` for (local) segmentation.
clement-pages committedNov 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 30ae9fb - Browse repository at this point
Copy the full SHA 30ae9fbView commit details -
fix: fix empty dict issue for
metadata_unique_values
in `prepared_d……ata`
clement-pages committedNov 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 72f9916 - Browse repository at this point
Copy the full SHA 72f9916View commit details -
improve: add dynamic typing for np array in
prepare_data
clement-pages committedNov 30, 2023 Configuration menu - View commit details
-
Copy full SHA for ecd2cb4 - Browse repository at this point
Copy the full SHA ecd2cb4View commit details -
Merge branch 'feat/data-preparation' into feat/joint-diarization-and-…
…embedding-with-prepared-data
clement-pages committedNov 30, 2023 Configuration menu - View commit details
-
Copy full SHA for 5e1abad - Browse repository at this point
Copy the full SHA 5e1abadView commit details
Commits on Dec 4, 2023
-
improve: check matching bewteen task current protocol and cached prot…
…ocol
clement-pages committedDec 4, 2023 Configuration menu - View commit details
-
Copy full SHA for fb6d540 - Browse repository at this point
Copy the full SHA fb6d540View commit details -
remove: remove unused argument
stage
inTask.setup
clement-pages committedDec 4, 2023 Configuration menu - View commit details
-
Copy full SHA for 3810308 - Browse repository at this point
Copy the full SHA 3810308View commit details -
Merge branch 'feat/data-preparation' into feat/joint-diarization-and-…
…embedding-with-prepared-data
clement-pages committedDec 4, 2023 Configuration menu - View commit details
-
Copy full SHA for f916db5 - Browse repository at this point
Copy the full SHA f916db5View commit details
Commits on Dec 8, 2023
-
update: change name of attribute
database_ratio
todia_task_rate
clement-pages committedDec 8, 2023 Configuration menu - View commit details
-
Copy full SHA for e7da160 - Browse repository at this point
Copy the full SHA e7da160View commit details -
wip: attempt to fix issues encountered during training
clement-pages committedDec 8, 2023 Configuration menu - View commit details
-
Copy full SHA for 77ac89f - Browse repository at this point
Copy the full SHA 77ac89fView commit details -
update: use all the
pyannet
pretrained modelclement-pages committedDec 8, 2023 Configuration menu - View commit details
-
Copy full SHA for ea6d06d - Browse repository at this point
Copy the full SHA ea6d06dView commit details -
fix: fix diarization loss calculation condition in
training_step
clement-pages committedDec 8, 2023 Configuration menu - View commit details
-
Copy full SHA for 185798d - Browse repository at this point
Copy the full SHA 185798dView commit details
Commits on May 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3fef4f5 - Browse repository at this point
Copy the full SHA 3fef4f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9d13697 - Browse repository at this point
Copy the full SHA 9d13697View commit details -
update the way batches are generated in the joint task
Now, the first `num_dia_samples` samples in a batch are dedicated to the diarization substak, and the remaining sample are for the embedding subtask
Configuration menu - View commit details
-
Copy full SHA for 6c67fc6 - Browse repository at this point
Copy the full SHA 6c67fc6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 519db89 - Browse repository at this point
Copy the full SHA 519db89View commit details -
Configuration menu - View commit details
-
Copy full SHA for 106bfc5 - Browse repository at this point
Copy the full SHA 106bfc5View commit details
Commits on May 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d3326b1 - Browse repository at this point
Copy the full SHA d3326b1View commit details
Commits on May 27, 2024
-
fix(task): fiw wrong call to
receptive_field
inprepare_chunk
clement-pages committedMay 27, 2024 Configuration menu - View commit details
-
Copy full SHA for a36420d - Browse repository at this point
Copy the full SHA a36420dView commit details -
Merge branch 'develop' into feat/joint-diarization-and-embedding-with…
…-prepared-data
clement-pages committedMay 27, 2024 Configuration menu - View commit details
-
Copy full SHA for 101f1d3 - Browse repository at this point
Copy the full SHA 101f1d3View commit details
Commits on May 28, 2024
-
update(joint task): filter out inactive speaker embeddings from loss …
…computation
clement-pages committedMay 28, 2024 Configuration menu - View commit details
-
Copy full SHA for 62fad78 - Browse repository at this point
Copy the full SHA 62fad78View commit details
Commits on Jun 21, 2024
-
allow to only compute mean or std in
StatsPool
clement-pages committedJun 21, 2024 Configuration menu - View commit details
-
Copy full SHA for 8349818 - Browse repository at this point
Copy the full SHA 8349818View commit details -
update diarization + embeddings joint task
clement-pages committedJun 21, 2024 Configuration menu - View commit details
-
Copy full SHA for 0858227 - Browse repository at this point
Copy the full SHA 0858227View commit details -
clement-pages committed
Jun 21, 2024 Configuration menu - View commit details
-
Copy full SHA for ad9e435 - Browse repository at this point
Copy the full SHA ad9e435View commit details -
Configuration menu - View commit details
-
Copy full SHA for aeb147f - Browse repository at this point
Copy the full SHA aeb147fView commit details
Commits on Jul 1, 2024
-
Merge branch 'pyannote:develop' into feat/joint-diarization-and-embed…
…ding-with-prepared-data
Configuration menu - View commit details
-
Copy full SHA for f484033 - Browse repository at this point
Copy the full SHA f484033View commit details
Commits on Jul 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8608a1c - Browse repository at this point
Copy the full SHA 8608a1cView commit details
Commits on Oct 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1132cfc - Browse repository at this point
Copy the full SHA 1132cfcView commit details -
Merge branch 'develop' into feat/joint-diarization-and-embedding-with…
…-prepared-data
clement-pages committedOct 18, 2024 Configuration menu - View commit details
-
Copy full SHA for 446c17c - Browse repository at this point
Copy the full SHA 446c17cView commit details -
Merge branch 'feat/joint-diarization-and-embedding-with-prepared-data…
…' of https://github.com/clement-pages/pyannote-audio into feat/joint-diarization-and-embedding-with-prepared-data
clement-pages committedOct 18, 2024 Configuration menu - View commit details
-
Copy full SHA for e6a00b9 - Browse repository at this point
Copy the full SHA e6a00b9View commit details
Commits on Oct 25, 2024
-
clement-pages committed
Oct 25, 2024 Configuration menu - View commit details
-
Copy full SHA for b91df8c - Browse repository at this point
Copy the full SHA b91df8cView commit details -
clean validation pipeline code
clement-pages committedOct 25, 2024 Configuration menu - View commit details
-
Copy full SHA for 5e54108 - Browse repository at this point
Copy the full SHA 5e54108View commit details
Commits on Oct 28, 2024
-
handle overlaped segmentation chunks corner case
clement-pages committedOct 28, 2024 Configuration menu - View commit details
-
Copy full SHA for 9b8e509 - Browse repository at this point
Copy the full SHA 9b8e509View commit details -
clement-pages committed
Oct 28, 2024 Configuration menu - View commit details
-
Copy full SHA for 7708935 - Browse repository at this point
Copy the full SHA 7708935View commit details