-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switch from arrow2 to arrow-rs since arrow2 has been deprecated #68
Comments
Thanks for raising this. I looked into it a while back, and determined that it was going to be a nontrivial amount of work for no immediate gain. I guess we'll eventually have to switch, but I'm just not very motivated to slog through it at the moment. Maybe the first step would be to note the salient differences between the two libraries. Help would be welcome there. |
I took an initial crack at it. Here are notable differences so far:
I haven't done much, but what I've done so far might help illustrate some of these: ef50bf2 |
@NickCondron Thanks for the head start! I have this working now in the arrow-rs2 branch. It relies on a hack to work around a bug in arrow-rs— I'm planning to wait until this PR lands so I can remove the hack. There's also a couple of performance regressions that need investigation:
My initial guess on 2 is that we might be unnecessarily storing some all-1 validity bitmaps, but I haven't figured out how to check that yet. No idea on 1 yet, and I don't know when I'll have the time for that kind of deep dive. Help would be very welcome! |
Here are my (similar) performance numbers for my x86 linux machine:
The surprising thing is the dramatic increase in the skip_frames tests which shouldn't use anything arrow related. If you can think of anything that could have caused this, let me know. I will continue to investigate. |
I'm less concerned about |
jorgecarleitao/arrow2#1429
The text was updated successfully, but these errors were encountered: