Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix the doc field in scraper_v3 #40

Open
tarunima opened this issue Dec 16, 2021 · 1 comment
Open

Fix the doc field in scraper_v3 #40

tarunima opened this issue Dec 16, 2021 · 1 comment
Labels
good first issue Good for newcomers

Comments

@tarunima
Copy link
Contributor

Due to a bug in article parser in https://github.com/tattle-made/factchecking-sites-scraper/tree/master/scraper_v3, the doc id for multiple media items is the same. A new doc id needs to be assigned to media items that were scraped through scraper_v3.

@tarunima tarunima added the good first issue Good for newcomers label Dec 16, 2021
@tarunima
Copy link
Contributor Author

tarunima commented Feb 4, 2022

Also need to go through older data scraped using v3 and change doc_id

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

1 participant