Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

labels for the test set #7

Open
zhiyunfan opened this issue Jul 1, 2021 · 11 comments
Open

labels for the test set #7

zhiyunfan opened this issue Jul 1, 2021 · 11 comments

Comments

@zhiyunfan
Copy link

The labels for the test set will be released to public after the VoxCeleb Speaker Recognition Challenge in October 2020. How can we download the labels for the test set?
Looking forward to your reply.

@scalfs
Copy link

scalfs commented Jul 12, 2021

I'm interested in the labels for the test set as well

@JaesungHuh
Copy link
Collaborator

The test set labels are now released.

@hbredin
Copy link

hbredin commented Jul 26, 2021

Thanks @JaesungHuh for sharing the test set labels.

For comparison with official results reported here, can you please confirm that these were computed on the subset of 232 files for which labels are available and not on the whole set of 312 files shared initially? This is important for the speaker diarization community to make sure we are not comparing apples and oranges.

cc @fnlandini @desh2608

@JaesungHuh
Copy link
Collaborator

@hbredin Thanks for the question. Yes, the released Voxconverse test set are subset of 232 files from the whole set of 312 files shared initially. We did another few rounds of check to make labels more accurate and removed some files which annotators couldn't be 100% sure of their annotation. Please use this version from now on.

@hbredin
Copy link

hbredin commented Jul 26, 2021

Thanks for clarifying.

What should we call this version in publications: VoxConverse 2021 ? VoxConverse v0.0.2?

@JaesungHuh JaesungHuh reopened this Jul 26, 2021
@JaesungHuh
Copy link
Collaborator

I'll re-open this issue for other people to see. I have to discuss co-authors about this, but I think either is fine. Will let you know if the term fixed.

@JaesungHuh
Copy link
Collaborator

We've recently released ver 0.3, fixing some of the errors in the test set labels. Please call "VoxConverse 0.3" when you use this dataset.

@hbredin
Copy link

hbredin commented Jul 22, 2022

Thanks for the heads-up @JaesungHuh.

Switching reference labels from 0.2 to 0.3 did "improve" my baseline by a whooping 2.8% (relative) in terms of speaker confusion rate. That is not negligible.

@JaesungHuh
Copy link
Collaborator

JaesungHuh commented Jul 22, 2022

Yes. We found these errors during the preparation for this year's VoxSRC workshop. I'll re-open this issue to let everyone know about this. I apologize for any inconvenience.

@JaesungHuh JaesungHuh reopened this Jul 22, 2022
@ahmadikalkhorani
Copy link

Where can I find the link to the video files?

@folalafish
Copy link

Where should I download the video file that corresponds to the audio file?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants