Possible to add sparse elements? #39

rob-p · 2022-03-25T16:55:56Z

Thanks for the very nice library! I'm interested in using hora for doing nearest neighbor finding in single-cell genomics. The data of interest consist of very high dimensional points (D = 30,000), but for most points, most dimensions have value 0. Therefore, I'd like to avoid (it's not really feasible) to densify the elements before indexing them. Is there some way to provide a custom implementation of the relevant distance metrics for the indexed type such that I don't have to actually insert a dense representation of the points into the index?

kacperlukawski · 2022-07-22T13:09:39Z

The project seems not to be maintained anymore, but since we're doing something similar at Qdrant (https://github.com/qdrant/qdrant), I think I may answer that question. Those tools are rather designed to support neural embeddings and they typically won't be sparse.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible to add sparse elements? #39

Possible to add sparse elements? #39

rob-p commented Mar 25, 2022

kacperlukawski commented Jul 22, 2022

Possible to add sparse elements? #39

Possible to add sparse elements? #39

Comments

rob-p commented Mar 25, 2022

kacperlukawski commented Jul 22, 2022