Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PCA as post-processing #94

Open
AlekseySh opened this issue Aug 14, 2022 · 0 comments
Open

PCA as post-processing #94

AlekseySh opened this issue Aug 14, 2022 · 0 comments
Labels

Comments

@AlekseySh
Copy link
Contributor

AlekseySh commented Aug 14, 2022

  1. Make an example on mock dataset and put it in the Docs here. It should look like this:
embeddings = inference(model, val, batch_size=4, num_workers=0)
rr = RetrievalResults.from_embeddings(embeddings, val, n_items=3)

embeddings_upd = PCA()(embeddings)
rr_upd = RetrievalResults.from_embeddings(embeddings_upd, val, n_items=3)

# after that we compare results visually and/or by metrics
  1. Check if it boosts metrics on one of our benchmarks. Check how much the retrieval time is improved.

  2. Write a shor report below the example on the same Doc's page.

PS. OML already has a PCA implementation, so, no need to bring an extra requirement.

@AlekseySh AlekseySh removed the good first issue Good for newcomers label Nov 20, 2022
@AlekseySh AlekseySh added the good first issue Good for newcomers label May 31, 2023
@AlekseySh AlekseySh changed the title PCA as a part of PostProcessor PCA as post-processing Jun 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: To do
Development

No branches or pull requests

1 participant