fix: Pandas parser does fail to parse integer or boolean only dataframes #1683

pierrecamilleri · 2024-09-13T13:24:24Z

fixes reading data from pandas returns null values #1678

Converting the Series returned by iterrows() to a dict converts np.int64 type to python's native int type and fixes the bug (same with booleans).

Adding non-regression tests

I was also concerned with the next lines, especially : if value is np.nan: value = None
- it was untested, so I added a test. It looks like to_dict would not change the behavior of np.nan conversion (see side note), so I left this code unchanged.
Primary keys are returned as ints or tuple[int], no np.int64 there
Timestamps types are kept unchanged, so the if isinstance(value, pd.Timestamp): still applies.

Side note

np.nan behavior is quite strange with df.iterrows() : in a number column, it will be converted to float("nan"), whereas in string column it will be kept as np.nan. Adding to_dict() to the row Series does not change the types.

This ensures resulting type is a native python type

Passes right away

pierrecamilleri · 2024-09-13T16:18:18Z

@roll @pdelboca ready for review !

pdelboca

Looks good! I like changes followed by tests 👍🏼

pierrecamilleri added 4 commits September 13, 2024 14:23

🔴 pandas df with integers or bool only

046f63e

🔵 tests

2bf5bfe

🟢 convert pd.Series to python dict

dac439f

This ensures resulting type is a native python type

🟢 adding test with np.nan

a80333e

Passes right away

pdelboca approved these changes Sep 13, 2024

View reviewed changes

pierrecamilleri merged commit 0e60e0b into main Sep 16, 2024
9 checks passed

pierrecamilleri deleted the fix/1678-pandas-parser-bug branch September 16, 2024 08:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Pandas parser does fail to parse integer or boolean only dataframes #1683

fix: Pandas parser does fail to parse integer or boolean only dataframes #1683

pierrecamilleri commented Sep 13, 2024 •

edited

Loading

pierrecamilleri commented Sep 13, 2024

pdelboca left a comment

fix: Pandas parser does fail to parse integer or boolean only dataframes #1683

fix: Pandas parser does fail to parse integer or boolean only dataframes #1683

Conversation

pierrecamilleri commented Sep 13, 2024 • edited Loading

Side note

pierrecamilleri commented Sep 13, 2024

pdelboca left a comment

Choose a reason for hiding this comment

pierrecamilleri commented Sep 13, 2024 •

edited

Loading