Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix timeout issue during the sim1_postprocess_s1_e1_filter_input phase #434

Open
marekhorst opened this issue Apr 21, 2022 · 0 comments
Open

Comments

@marekhorst
Copy link
Member

Originally reported in: openaire/iis#1326

Documents similarity algorithm fails after running it on a non-deduplicated OpenAIRE Graph counting 300M of publications (deduped graph included 200M).

After in depth inspection covered by the openaire/iis#1326 (comment) it turned out we need to modify documents similarity sources by increasing allowed timeout value which should be defined in sim1-postprocess-s1-e1-filter-sims.pig PIG script.

marekhorst added a commit that referenced this issue Apr 21, 2022
…er_input phase

Setting mapreduce.task.timeout value to 7200000 in sim1-postprocess-s1-e1-filter-sims.pig.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant