Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation requestion: SearchRankParameters #5

Open
chris-martin opened this issue Dec 13, 2020 · 1 comment
Open

Documentation requestion: SearchRankParameters #5

chris-martin opened this issue Dec 13, 2020 · 1 comment

Comments

@chris-martin
Copy link
Contributor

chris-martin commented Dec 13, 2020

I've found this package pretty easy to work with so far, with the exception of SearchRankParameters. In particular, I don't understand what paramK1 and paramB mean. I see them mentioned in The Probabilistic Relevance Framework: BM25 and Beyond section 3.4.5, Document Length, and I see section 3.5, Uses of BM25, has a few relevant bits:

setting b = 1 will perform full document-length normalisation, while b = 0 will switch normalisation off

A significant number of such experiments have been done, and suggest that in general values such as 0.5 < b < 0.8 and 1.2 < k1 < 2 are reasonably good in many circumstances

But I get from this no sense of what k1 means, and little guidance as to why I would want more or less document-length normalization for a particular field.

I don't think elaborate explanations in the documentation are necessary; a few hints would likely go a very long way.

@chris-martin
Copy link
Contributor Author

@dcoutts If you might be able to say how you selected these parameter values for the demo program, any recollections would be most welcome.

paramK1 :: Float
paramK1 = 1.5
paramB :: PkgDocField -> Float
paramB NameField = 0.9
paramB SynopsisField = 0.5
paramB DescriptionField = 0.5

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant