Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why can't use SAM encoder to get extracted feature? #5

Open
ruizhaoz opened this issue Aug 10, 2023 · 2 comments
Open

Why can't use SAM encoder to get extracted feature? #5

ruizhaoz opened this issue Aug 10, 2023 · 2 comments

Comments

@ruizhaoz
Copy link

Have you try directly use SAM encoder to extract feature instead use other pretrained model?

@yangliu96
Copy link
Collaborator

yangliu96 commented Aug 11, 2023

The features extracted using SAM achieve only around 20 mIoU on fold 0 of COCO-20i. The SAM encoder with weak semantics performs poorly in complex scenes. Here are two reasons for this:

  1. Poor feature matching: SAM's features fail to match multiple instances with similar semantics in complex scenes.
  2. Poor semantic guidance: SAM cannot provide effective semantic guidance for ILM (Instance-Level Matching) to select high-quality mask proposals.

@fjchange
Copy link

Dinov2 has great ability in instance retrieval / dense matching. The backbone of SAM is pretrained via MAE, whose feature is not that discriminative.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants