Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IR Metrics Without a Cutoff #72

Open
Hamedloghmani opened this issue May 16, 2023 · 2 comments
Open

IR Metrics Without a Cutoff #72

Hamedloghmani opened this issue May 16, 2023 · 2 comments
Assignees
Labels
literature-review Summary of the paper related to the work

Comments

@Hamedloghmani
Copy link
Member

Regarding the usage of IR utility metrics without a cutoff, the summary of answers that I have found is as follows:

When information retrieval (IR) utility metrics are used without a cutoff, it typically means that the evaluation considers the entire ranked list of retrieved documents, without truncating or limiting the evaluation to a specific number of documents.
In IR evaluation, a cutoff is often used to determine the depth or position in the ranked list up to which the evaluation will be performed. For example, a cutoff at position 10 means that only the top 10 documents in the ranked list will be considered for evaluation.
However, when utility metrics are used without a cutoff, the evaluation takes into account the relevance or utility of documents across the entire ranked list, giving equal weight to all positions. This means that the metric considers the quality of the ranking across all the retrieved documents, rather than focusing on a specific subset.
Using utility metrics without a cutoff provides a more comprehensive assessment of the ranking algorithm or system by considering the relevance or utility of documents throughout the entire result set. It helps evaluate the overall effectiveness of the ranking in providing high-quality and relevant results, without emphasizing a particular cutoff point or subset of documents.

@Hamedloghmani Hamedloghmani added the literature-review Summary of the paper related to the work label May 16, 2023
@Hamedloghmani Hamedloghmani self-assigned this May 16, 2023
@hosseinfani
Copy link
Member

@Hamedloghmani
thanks for the update.
this was out initial thought. Are you sure when there is no cutoff, the while list is considered? like recall becomes 1 always?

Can you put some citations or links to support your findings?

@Hamedloghmani
Copy link
Member Author

@hosseinfani
Thank you for your feedback. Sure, some of the resources I have used are as follows:

  • In FA*IR: A Fair Top-k Ranking Algorithm, Section 5.3, they formulated the NDCG without cutoff from 1 to k and indicated k is the size of the query.
  • In libraries that provide these metrics such as scikit-learn etc. not having cutoff leads to considering the whole query
  • Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to Information Retrieval. Cambridge University Press.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
literature-review Summary of the paper related to the work
Projects
None yet
Development

No branches or pull requests

2 participants