-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Permit multiple input files on the accelsearch command line #120
base: master
Are you sure you want to change the base?
Conversation
traversing $PATH and $LD_LIBRARY_PATH each time a new search starts
Thanks for the pull request! I like the idea. I'm not sure I like the implementation with "--", though. Especially as it breaks the automatic creation of all the accelsearch_clig files. Is it possible to change the way that it works so that it simply checks to see if there is more than one input file, like was done in #117 ? I haven't had a chance to fully review that PR yet, but I will almost certainly merge it (and it has exactly the same goal as yours). |
This also applies to PR #117 I just pushed up changes to the command line parsing for realfft, accelsearch, and zapbirds (which all likely need to be called many times on clusters) to allow up to 16384 input files. I did this using the "clig" software that auto-generates the command line interfaces. This should make using multiple files a lot better and more consistent. The way it would work is like the following: After the clig parser is called:
If you can please modify your PR so that it handles the new changes, I'll happily merge it. Thank you! |
One other comment: Is the fork() really necessary? If you have all the input filenames, why not simply iterate over them with a for loop within the original process? |
Thank you for mentioning #117 , which I should certainly have read earlier. Yes, I will make them consistent tomorrow since it's midnight here already. My apologies for the fork(2) and for breaking the generation of accelsearch_clig ... Since my job is at the data center and unrelated to compact object astrophysics, this patch was an emergency fix for someone who congested our storage cluster. Had got only a few minutes to go over the presto code, and was unsure if there is no static variable and a loop is sufficient without fork-ing. So my first attempt was to fork(2) to make it safe, ... and on Linux that doesn't add much cost anyway. |
No apologies necessary! I really appreciate you making the PR and making PRESTO better to use for others. Thank you so much! I'll verify about fork(), but I'm pretty sure that simply looping over calls to "main()" for each file should be completely fine. |
So I just checked and it isn't quite a simple as iterating over main() as create_accelobs() will need a slight modification as well since it gets sent the full parsed command line. I'm happy to help test any changes you make. And if I weren't swamped, I would happily make all the changes myself. If this is really a problem for you to make the changes, I can eventually make the changes myself. But it might take a while. |
I have corrected the clig thing and also got rid of a call to system(3). However, a closer look showed many calls to exit(3) at various levels of function calls. More complex/dangerous than I thought yesterday to loop without fork-ing. Would you like to re-consider the loop-and-fork approach? With command line parsing outside the loop, of course. Fork-ing may have a negative impact on performance and resource utilization if the program should support MS Windows (but should be ok for the Windows subsystem for Linux 2). |
…void" This conflicts with the clig command line parser / doc generator files, and also needs further optimization on command line parsing. This also missed a call to system(3). Leaving it there partly defeated the purpose of this commit, which is to reduce the number of shell executions and associated searched over $PATH and $LD_LIBRARY_PATH. This reverts commit f025633.
Just checking about the intent of the fork() idea... Is your intent to run all of the accelsearches in parallel with the fork calls? Because that was more than I was thinking. I was simply thinking about running them in serial, all within the original process. |
No. In serial, but exit(3), malloc without free, non-stateless code like global variables and static local variables cannot possibly mess up subsequent accelsearches, and the parent process is free to decide if it will proceed to the next one when the previous accelsearch fails (i.e., exited with error status). |
I see. OK. That helps. And I agree that that does seem potentially helpful. I'll take a look at your PR as well as the possibility to avoid fork() and still get the same benefits. |
Or probably later when I make another commit with correct clig changes, just to save your time? BTW, Writing code that loops for long (or forever) without randomly crashing may be non-trivial according to my experience on writing online data reduction programs for high frequency radio astronomy. So from my side, at least, fork-ing to isolate code with casual error checks is not more guilty than asking you to work over the weekend :-) |
Fair enough! And after you explained your reasoning for forking, I think I might be coming around to your way of thinking. |
Catch up with original repo
Updated to a fork-ing approach that conforms to the original command line format. This reduces ~50% of the metadata server load on a cluster. Will submit another PR later to get rid of the other ~50% load. |
pressure on the clustered filesystem's metadata server.
Hi Scott, Sorry for coming back late, but I have heavy technical work to do, and playing with pulsars is only my hobby. I have included a bit more changes with this pull request to reduce filesystem access further, eliminating the need to congest the storage cluster when doing acceleration searches on many short observations. Now this PR is complete, I guess. Please review. Best regards, |
Thanks, Azumi! This has been a crazy semester, but I completely plan to handle this PR (perhaps with small changes) over the next month or so. I really appreciate your contribution! |
Permit multiple input files on the command line in order to avoid traversing $PATH and $LD_LIBRARY_PATH each time a new accelsearch starts.
Those who reduce pulsar short observations on large computer clusters restart accelsearch every few milliseconds, traversing their $PATH and $LD_LIBRARY_PATH on the shared filesystem each time, causing massive slowdown of the whole cluster.