Skip to content
This repository has been archived by the owner on Dec 14, 2023. It is now read-only.

duplicate document in solr index #721

Open
hroberts opened this issue Jul 27, 2020 · 0 comments
Open

duplicate document in solr index #721

hroberts opened this issue Jul 27, 2020 · 0 comments

Comments

@hroberts
Copy link
Contributor

The solr index seems to have duplicate documents in it. Those duplicates are filtered out when document lists are returned but are present when results are merely counted. For the time being, we are using the hll() json.facet function to estimate unique stories_ids within results, but at some point we should figure out how the duplicates are getting into the database even though we are importing everything with overwrite=true.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant