How can we perform segmentation in real-time? #60

CURRY-AND-RICE · 2024-07-31T07:44:51Z

Currently, I think we can only input video via separated frames stored in a directory.
However, for online applications, we should be able to input frames sequentially as they come in.
Are there any existing solutions to facilitate this?
Additionally, are there plans to add such functionality in the future?

Thank you for amazing work!

rolson24 · 2024-07-31T13:39:09Z

I opened a PR that can run directly on a video file without extracting and loading all of the frames into memory at once, but it doesn't support a video stream. I would most likely require a large refactor of this repository's codebase to support a video stream, but I know huggingface are working to add the model to transformers, which may be able to support running on a stream.

CURRY-AND-RICE · 2024-08-01T03:11:55Z

Thank you for notifying me of such important information!
I found an issue on hugginface for adding SAMv2 which is currently in progress.
I will continue to explore ways to achieve stream inference and will keep this issue open.

Joao-Pimenta · 2024-08-11T21:37:20Z

@CURRY-AND-RICE Did you find a good implementation?

CURRY-AND-RICE · 2024-08-13T15:48:22Z

@Joao-Pimenta
I've been unable to find an implementation that matches my needs.
Maybe this will help. #90

heyoeyo · 2024-08-13T21:33:20Z

Are there any existing solutions to facilitate this?

I have a basic example script that runs off videos (should work with webcams even), though it's not finalized and may be missing some features compared to the original video prediction implementation.

Edit: There's also now a UI version, which can also work on webcam:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can we perform segmentation in real-time? #60

How can we perform segmentation in real-time? #60

CURRY-AND-RICE commented Jul 31, 2024

rolson24 commented Jul 31, 2024 •

edited

Loading

CURRY-AND-RICE commented Aug 1, 2024

Joao-Pimenta commented Aug 11, 2024

CURRY-AND-RICE commented Aug 13, 2024

heyoeyo commented Aug 13, 2024 •

edited

Loading

How can we perform segmentation in real-time? #60

How can we perform segmentation in real-time? #60

Comments

CURRY-AND-RICE commented Jul 31, 2024

rolson24 commented Jul 31, 2024 • edited Loading

CURRY-AND-RICE commented Aug 1, 2024

Joao-Pimenta commented Aug 11, 2024

CURRY-AND-RICE commented Aug 13, 2024

heyoeyo commented Aug 13, 2024 • edited Loading

rolson24 commented Jul 31, 2024 •

edited

Loading

heyoeyo commented Aug 13, 2024 •

edited

Loading