Add support for directly running on segmentation on video files. #46

rolson24 · 2024-07-30T22:57:43Z

This PR adds support for running segmentation directly on video files instead of individual image files. It uses torchvision's built-in VideoReader object and only adds the dependancy of PyAV which is are the python bindings for ffmpeg. Alternatively, users could compile torchvision from source with the video_reader backend if they didn't want to install PyAV. I think this could really improve the easy of building demos for SAM-2 if this gets added because then the entire video doesn't have to be extracted first and then read into RAM.

I will do some more rigorous testing to make sure it doesn't affect the expected behavior, but it seems to be working for now.

facebook-github-bot · 2024-07-30T22:57:48Z

Hi @rolson24!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

facebook-github-bot · 2024-07-30T23:23:52Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

jordan-barrett-jm · 2024-08-01T00:32:22Z

Hey @rolson24 this is super interesting! I've actually been attempting to run video segmentation using longer videos (5+ minutes) and I've been running into memory allocation errors. Have you tested longer videos using this method?

rolson24 · 2024-08-01T00:58:49Z

@jordan-barrett-jm
I have not fully tested it yet, but I'm going to right now. I will let you know if it works. Also I think the Ultralytics team is working hard to integrate SAM-2.0 into their library with online video segmentation, but its not quite ready yet.

Also I have a colab notebook that demonstrates this change that is based on the Roboflow one here

jordan-barrett-jm · 2024-08-01T01:18:31Z

Thanks! One solution I've found in the interim is mini batching the images

MattLiutt · 2024-08-01T01:48:02Z

Thanks for the great work! Just curious, it seems like we cannot still add new point during inference right? What I mean is sort of real-time tracking.

rolson24 · 2024-08-01T02:31:34Z

I think you could add a new point by just using the add new point function in the for loop when you want to add a new prompt. I have not tested that theory though. Also if you are still running out of GPU memory, try to use the ‘offload_state_to_cpu’ parameter when you initialize the state so the states get stored on the system ram.

…

On Wed, Jul 31, 2024 at 9:58 PM MattLiutt ***@***.***> wrote: Thanks for the great work! Just curious, it seems like we cannot still add new point during inference right? What I mean is sort of real-time tracking. — Reply to this email directly, view it on GitHub <#46 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AX2EJPFSXGYVERO4KVUEGKLZPGHWPAVCNFSM6AAAAABLXJKLQ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENRRG44DQNRZGM> . You are receiving this because you were mentioned.Message ID: ***@***.***>

MattLiutt · 2024-08-01T09:27:35Z

I've tested and checked the code, once the inference started, it prohibited from adding new points.

Bhumika28661773 · 2024-08-01T13:40:33Z

@rolson24 how do i run and test it and how can i get to know the label assigned to each object in the video?

… segmentation on video files.

rolson24 and others added 5 commits July 30, 2024 18:29

Add support for mp4 video files with torchvision VideoReader

231111d

fix video resolution

dbdc3e0

fix image resize

5319d77

test fixing seeking issue

575fec2

remove print statements

104c368

facebook-github-bot added the cla signed label Jul 30, 2024

rolson24 mentioned this pull request Jul 31, 2024

How can we perform segmentation in real-time? #60

Open

rolson24 added 2 commits July 31, 2024 21:03

add support for m4v

19e09e9

Update sam2_video_predictor.py

517b71a

rolson24 added 3 commits July 31, 2024 21:25

add support for m4v

e0a4dfa

remove m4v

78f7416

remove m4v

4538bfe

heyoeyo mentioned this pull request Aug 26, 2024

SAM2 for segmenting a 2 hour video? #264

Open

dcnieho added a commit to dcnieho/segment-anything-2 that referenced this pull request Sep 27, 2024

Pull Request facebookresearch#46: Add support for directly running on…

612061f

… segmentation on video files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for directly running on segmentation on video files. #46

Add support for directly running on segmentation on video files. #46

rolson24 commented Jul 30, 2024

facebook-github-bot commented Jul 30, 2024

facebook-github-bot commented Jul 30, 2024

jordan-barrett-jm commented Aug 1, 2024

rolson24 commented Aug 1, 2024

jordan-barrett-jm commented Aug 1, 2024

MattLiutt commented Aug 1, 2024

rolson24 commented Aug 1, 2024 via email

MattLiutt commented Aug 1, 2024

Bhumika28661773 commented Aug 1, 2024

Add support for directly running on segmentation on video files. #46

Are you sure you want to change the base?

Add support for directly running on segmentation on video files. #46

Conversation

rolson24 commented Jul 30, 2024

facebook-github-bot commented Jul 30, 2024

Action Required

Process

facebook-github-bot commented Jul 30, 2024

jordan-barrett-jm commented Aug 1, 2024

rolson24 commented Aug 1, 2024

jordan-barrett-jm commented Aug 1, 2024

MattLiutt commented Aug 1, 2024

rolson24 commented Aug 1, 2024 via email

MattLiutt commented Aug 1, 2024

Bhumika28661773 commented Aug 1, 2024