-
Notifications
You must be signed in to change notification settings - Fork 126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add debug information for knn queries #2289
Comments
@navneet1v I looked at the https://opensearch.org/docs/latest/api-reference/explain/. From what I understood, it explains how the score was calculated. That can be extremely useful, but the ask is more on giving users code execution flow through logs [mostly debug] and have timing/profiling information which won't be there in query breakdown. Having a lot of information in breakdown can be overwhelming to an extent where it can create ambiguity if internal working is not known Let me know if I am missing something related to explain
So if we end up doing 2. then this can be an extension of #2286. Else it is not and can be treated independently. |
I think in that case I would recommend having 1 github issue and rather than 2. I feel what we want to do is have a mechanism to profile the vector query at the end of day, which is exactly what the first GH issue: #1985 was talking about. I think we should collate all these ideas at 1 place so that when we implement the feature we have all discussions at 1 place. I leave it upto you how you want to do this. |
Is your feature request related to a problem?
With complex query execution paths for KNN query, its difficult to figure out the query execution flow. This makes it difficult to debug the query. There are limited debug logs in KNNQuery, I would like to have more useful debug logs to understand the query execution to be able to debug the query.
I want to meaningful logs which help me understand:
This should be added without impacting the latencies
What solution would you like?
There are couple of existing solutions which can be leveraged to have these:
Both solutions can be leveraged. With both solutions, benchmarking should be done to make sure latencies aren't impacted
The text was updated successfully, but these errors were encountered: