Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update cross-encoder model prefix #6686

Merged
merged 2 commits into from
Mar 15, 2024
Merged

Update cross-encoder model prefix #6686

merged 2 commits into from
Mar 15, 2024

Conversation

kolchfa-aws
Copy link
Collaborator

Closes #6658

Checklist

  • By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and subject to the Developers Certificate of Origin.
    For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@kolchfa-aws
Copy link
Collaborator Author

@HenryL27 Could you please review this PR?

Copy link
Contributor

@HenryL27 HenryL27 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks! one nit, especially in case we someday add a model that doesn't start huggingface/cross-encoders

@@ -66,12 +66,12 @@ The following table provides a list of sparse encoding models and artifact links

Cross-encoder models support query reranking.

The following table provides a list of cross-encoder models and artifact links you can use to download them.
The following table provides a list of cross-encoder models and artifact links you can use to download them. Note that you must prefix the model name with `huggingface/cross-encoders`, as shown in the **Model name** column.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that you must prefix the model name with huggingface/cross-encoders, as shown in the Model name column.

idk if this sentence is necessary since it's documented in the table. As a user I copy/paste the model name out of the table and stick it in the pretrained model _register body, so as long as the thing in the table is correct, I'm happy.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have the same sentence for transformer models. Mainly because when you look at the model config, there is a name field that does not include the full prefix (for example, cross-encoder/ms-marco-MiniLM-L-6-v2). So users might be confused why the name in the table is not the same as the name in the model config.

@kolchfa-aws kolchfa-aws merged commit 7cac2a7 into main Mar 15, 2024
4 checks passed
@kolchfa-aws kolchfa-aws added the backport 2.12 PR: Backport label for 2.12 label Mar 15, 2024
opensearch-trigger-bot bot pushed a commit that referenced this pull request Mar 15, 2024
Signed-off-by: Fanit Kolchina <[email protected]>
(cherry picked from commit 7cac2a7)
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
kolchfa-aws pushed a commit that referenced this pull request Mar 15, 2024
(cherry picked from commit 7cac2a7)

Signed-off-by: Fanit Kolchina <[email protected]>
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
@Naarcha-AWS Naarcha-AWS deleted the cross-encoder branch March 28, 2024 23:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport 2.12 PR: Backport label for 2.12
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[DOC] Update cross-encoder model names
3 participants