Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generated file contents inconsistent with the original ones #81

Open
SEICS opened this issue Apr 22, 2022 · 2 comments
Open

Generated file contents inconsistent with the original ones #81

SEICS opened this issue Apr 22, 2022 · 2 comments
Labels

Comments

@SEICS
Copy link

SEICS commented Apr 22, 2022

Hi,

Thanks for the great work! The original branch actually got lots of problems to solve when building on macOS Monterey :).

I observed that, like the title, the generated files have inconsistent contents compared to the original ones. For example, for customer.tbl (generated with this branch's dbgen), I only have 14999981 rows of data while the original customer.tbl has 15000000 rows:


My customer.tbl:

Screenshot 2022-04-22 at 06 03 31

The original customer.tbl:

Screenshot 2022-04-22 at 06 04 36


It seems like the generation of .tbl files does not complete as expected, although the command line showed that it's done, shown as below:


Screenshot 2022-04-22 at 06 16 13


Also, similar behaviour occurs for date.tbl and supplier.tbl. I haven't finished generating lineorder.tbl since it's a big file so I don't know if it also happens to it. Only the part.tbl obtains the consistent result as the original part.tbl.

In addition, some data values in date.tbl do not align with that in the original date.tbl:


My date.tbl:

Screenshot 2022-04-22 at 06 14 37

The original date.tbl:

Screenshot 2022-04-22 at 06 14 53


I don't know if it's only a personal problem or if it actually happens to everyone. Thanks for helping!

@eyalroz
Copy link
Owner

eyalroz commented Apr 22, 2022

In addition, some data values in date.tbl do not align with that in the original date.tbl

Please file a separate bug about this and I'll comment there.

@eyalroz
Copy link
Owner

eyalroz commented Apr 22, 2022

compared to the original ones.

Original in what sense? The TPC-H dbgen? The ssb dbgen as written by the paper's author? An earlier version here on GitHub?

Also, customer.tbl has 30,000 rows, not 1.5 Million.

@eyalroz eyalroz added the bug label Apr 22, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants