-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Too many file access using the Python interfacte #45
Comments
After a bit of digging in the code:
However, in contrary to the C-MPI, python wrapper opens the file on all processes for read. After a dataset is indexed, the flow is :
This might be why the I/O is overwhelmed with large number of processes. I am planning to try |
Yup, that would do it! |
One reason the python api did not use the c MPI api was that back then it
was hard to create a python surrogate for the c MPI communicator object.
Recent mpi4py offers a way ( I recall still not a public api but it is
intended to be supported) of creating a python surrogate from a c pointer,
so a rewrite based on that could be nice.
…On Fri, Aug 16, 2024 at 4:02 PM Simeon Bird ***@***.***> wrote:
Yup, that would do it!
—
Reply to this email directly, view it on GitHub
<#45 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AABBWTC6ISZWQBFVEWBCPMTZR2AJNAVCNFSM6AAAAABLRELZE2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOJUGQZDGOJVGQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
@rainwoodman
Hi Yu,
Admins on TACC have raised an issue with our code using the
bigfile
python interface. Apparently, it invokes too many queries to the filesystem, similar to your comment on README file:Not quite sure if this #43 fixes this though, since my issue is with reading not writing files.
Any starter clues, like which source file I need to dive into to fix this?
The text was updated successfully, but these errors were encountered: