Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple captions for the same language #5

Open
jobreu opened this issue Feb 25, 2021 · 4 comments
Open

Multiple captions for the same language #5

jobreu opened this issue Feb 25, 2021 · 4 comments

Comments

@jobreu
Copy link

jobreu commented Feb 25, 2021

First of all, thank you for this great package! We just used this in a workshop on working with YouTube data and found it extremely helpful (especially since the get_captions function from the tuber package does not seem to work anymore.
While using the package in the workshop, we were wondering whether it may be possible to add an option for choosing between different caption tracks for the same language. For some videos, there are automatically generated (ASR) and manually generated caption tracks for the same language. If that is not (easily) possible, could you maybe say which track the function picks if there is more than one caption track for the same language? The most recently created/edited one or does it select manually generated captions if they exist and ASR otherwise?

@rangaro
Copy link

rangaro commented Sep 2, 2021

Is there any update on this issue? We want to mention this package in a book chapter, but as of now, we would also have to mention this limitation.

@jooyoungseo
Copy link
Owner

Sorry for my delayed response. I will investigate this issue this weekend, and will get back to you all! Thank you very much for your patience.

@jooyoungseo
Copy link
Owner

It would be greatly appreciated if either of you could provide me with a sample video URL for better reproducibility.

@rangaro
Copy link

rangaro commented Sep 3, 2021

Thanks for looking into this issue!

ID = 3TNkWTRNNYE

This is one of my own videos where YouTube automatically created subtitles (as it always does; those get the stamp "ASR"). Afterwards, I manually edited the subs and thus created another set of subtitles.

You can use the package "tuber" to get a list of the subtitle tracks: list_caption_tracks(video_id = "3TNkWTRNNYE").

If you wish, I can create an additional test set of subtitles for this video.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants