Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Big File compare slowly #42

Open
xidianwlc opened this issue Mar 19, 2020 · 1 comment
Open

Big File compare slowly #42

xidianwlc opened this issue Mar 19, 2020 · 1 comment

Comments

@xidianwlc
Copy link

when a file is 1TB then csvdiff was slow and cpu used too much

you can use binary diff algorighm

if xxHash(src multilines) == xxHash(dst multilines) then
continue
else
compare line by line

@aswinkarthik
Copy link
Owner

Will you be able to share some stats about this:

  1. How big was base and delta file?
  2. Did it finish? If so, how much time did it take?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants