Replies: 1 comment 4 replies
-
Hi! It doesn't. At this point it's just an ability to read data from this format. In order to leverage its potential to be zero-copy, we'll have to replace the underlying data storage from lists to Apache Arrow vectors if I'm not mistaken. There's a big question how it should be done and what tradeoffs it implies, because Apache Arrow is off-heap memory management and will require end users to manage allocators to perform operations. edit. Actually, maybe zero-copy reading (not writing) can be done more easily, hm.. Like, if a special type of value columns backed by vectors is implemented, we can avoid allocations when reading dataframe. But every operation will still allocate lists when it needs to create a new column. How does it sound? |
Beta Was this translation helpful? Give feedback.
-
Hello,
I may be missing something, but after inspecting the code, it seems to me that the current implementation does not make use of Apache Arrow Zero Copy support when reading files. Am I mistaken? Can anyone comment?
Thank you
Beta Was this translation helpful? Give feedback.
All reactions