knn search #351

fendoukobe · 2021-06-29T06:59:06Z

""" GET my-knn-index-1/_search
{
"size": 2,
"query": {
"knn": {
"my_vector2": {
"vector": [2, 3, 5, 6],
"k": 2
}
}
}
}
k is the number of neighbors the search of each graph will return. You must also include the size option. This option indicates how many results the query actually returns. The plugin returns k amount of results for each shard (and each segment) and size amount of results for the entire query. The plugin supports a maximum k value of 10,000."""

hi，I want to know what the K in this passage means, And,In development I should set the appropriate value for this k, 10 or 100? or something else

neo-anderson · 2021-07-07T20:13:57Z

It depends on your usecase. By setting k and size to 10, for example, you get the closest 10 results for your query (aka top 10 neighbors of the vector in your query).

fendoukobe · 2021-07-08T02:59:11Z

thanks,i get it

The KNN index has millions of docs,Knn search fast
but,When the number reaches tens of millions, the speed is very slow, more than 20 seconds，
How do I optimize it？
i want to warmup index,
How to calculate the memory required？

fendoukobe · 2021-07-08T03:16:36Z

and how to cancel the warmup index?
Whether the validity time is related to the parameter "knn.cache.item. Expiry. Minutes ": "10m"？
thanks

fendoukobe · 2021-07-09T00:41:32Z

hi,
I see that KNN has a branch - Faiss Support.
Whether this branch queries faster and uses less memory？
How to use it in my Elasticsearch cluster?

neo-anderson · 2021-07-09T04:59:19Z

The KNN index has millions of docs,Knn search fast
but,When the number reaches tens of millions, the speed is very slow, more than 20 seconds，

Check if your query is a bruteforce script or Approximate k-NN : https://opendistro.github.io/for-elasticsearch-docs/docs/knn/

fendoukobe · 2021-07-09T05:37:37Z

Approximate k-NN Search

jmazanec15 · 2021-07-09T23:31:52Z

Hi @fendoukobe

Here is how we calculate memory: https://opendistro.github.io/for-elasticsearch-docs/docs/knn/performance-tuning/#estimating-memory-usage.

After the slow query, could you paste the knn stats?
GET /_opendistro/_knn/stats?pretty"

With regards to faiss support, we are actively working on it here. I am working on an RFC and will post soon. We want to support faiss's product quantization in order reduce memory consumption. The branch on this repo is a development branch and should not be used in production. It only includes faiss's HNSW implementation, which should not have significant performance differences compared to nmslib.

fendoukobe · 2021-07-12T01:00:33Z

嗨@fendoukobe

以下是我们计算内存的方法：https : //opendistro.github.io/for-elasticsearch-docs/docs/knn/performance-tuning/#estimating-memory-usage。

在慢查询之后，你能粘贴 knn 统计数据吗？
GET /_opendistro/_knn/stats?pretty"

关于 faiss 支持，我们正在这里积极工作。我正在研究 RFC，很快就会发布。我们希望支持 faiss 的乘积量化以减少内存消耗。此 repo 上的分支是开发分支，不应在生产中使用。它只包括 faiss 的 HNSW 实现，与 nmslib 相比，它不应有显着的性能差异。

sorry,i can not provide the data now ,because the production environment is somewhere else.
But I can confirm that the memory footprint ratio is no more than 20%。
thank you

jmazanec15 · 2021-07-19T21:50:28Z

@fendoukobe I see. What is the dimension on your vectors? Also, how many nodes are you running on and what type of machines are you using?

fendoukobe · 2021-07-20T09:14:26Z

the demension is 1024,
There are three nodes,
Each node has 512GB of memory, two physical CPUs, and cores for one physical CPU are 16
A server becomes two virtual nodes, sharing memory and CPU,
Hot nodes are solid-state drives, warm nodes are mechanical drives

My configuration is as follows

PUT /_cluster/settings
{
"persistent": {
"knn.cache.item.expiry.enabled": true,
"knn.cache.item.expiry.minutes": "10m",
"knn.memory.circuit_breaker.limit": "60%",
"knn.circuit_breaker.unset.percentage": 90,
"knn.algo_param.index_thread_qty": 32
}
}

jmazanec15 · 2021-07-28T18:08:06Z

One potential way to speed up is to not return the vector field in your query and only return the document id (if your use case lets you). This can be done by adding the query parameter ?_source_exclude=my_vector2.

Can you provide the query you are using in the case of high latency?

zxbing · 2022-04-15T09:07:27Z

One potential way to speed up is to not return the vector field in your query and only return the document id (if your use case lets you). This can be done by adding the query parameter ?_source_exclude=my_vector2.

Can you provide the query you are using in the case of high latency?

good idea

vamshin assigned jmazanec15 Jul 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

knn search #351

knn search #351

fendoukobe commented Jun 29, 2021

neo-anderson commented Jul 7, 2021

fendoukobe commented Jul 8, 2021

fendoukobe commented Jul 8, 2021

fendoukobe commented Jul 9, 2021

neo-anderson commented Jul 9, 2021

fendoukobe commented Jul 9, 2021

jmazanec15 commented Jul 9, 2021

fendoukobe commented Jul 12, 2021

jmazanec15 commented Jul 19, 2021

fendoukobe commented Jul 20, 2021

jmazanec15 commented Jul 28, 2021

zxbing commented Apr 15, 2022

knn search #351

knn search #351

Comments

fendoukobe commented Jun 29, 2021

neo-anderson commented Jul 7, 2021

fendoukobe commented Jul 8, 2021

fendoukobe commented Jul 8, 2021

fendoukobe commented Jul 9, 2021

neo-anderson commented Jul 9, 2021

fendoukobe commented Jul 9, 2021

jmazanec15 commented Jul 9, 2021

fendoukobe commented Jul 12, 2021

jmazanec15 commented Jul 19, 2021

fendoukobe commented Jul 20, 2021

jmazanec15 commented Jul 28, 2021

zxbing commented Apr 15, 2022