-
-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ntmax not obeyed #340
Comments
@digitalwright I don't have the definitive answer here, but I have two suggestions. First, the core detection is automatic, and just asks a question of the machine about how many cores exist. So that doesn't in itself indicate CPU usage. Second, the reading of the alignment used ~50% of a CPU, also suggesting that IQ-TREE was obeying ntmax in this case. One thing I would try (while waiting for @bqminh or others who actually know the answer to reply - it's a busy time here at the moment) is to remove Can you try that and report back? Also if you can show the full command line, input files, and output files, that will be helpful for diagnosing the issue. |
Thanks @roblanf I will try to obtain a reproducible case and report back to you if I figure it out. It only happens with jobs running iqtree2. The grid manager seems to catch this issue stochastically, so I need to figure out exactly how they are flagging these instances. If you don't hear back from me soon, and don't think the issue stems from iqtree2, please feel free to close this issue. |
I have a reproduction of what sounds like the same issue: #312 (comment) I can definitely see something like 210% CPU in htop which should never happen if --max-threads was respected @digitalwright how are you invoking iqtree? As part of augur tree? we set -T auto by default there but you can override it by passing -T 1 or whatever you want. This is how I worked around the issue. |
@corneliusroemer that sounds like a good option to try. If I get a chance I'll try something here as well - different combinations of |
Just to share what some initial investigation revealed: I could reproduce my reproduction on current master, on 2.3.6 and 2.3.5. So it doesn't seem to be a very recent change. |
Thank you for looking into the issue, @roblanf and @corneliusroemer I can confirm that exceeding 1 CPU is reproducible, although our computer grid only catches the issue a fraction of the time. I am guessing this is because 1 CPU is exceeded for a duration shorter than the CPU polling frequency. I am invoking IQ-TREE2 on the command line with:
The two attached files are examples of culprits. Caryophyllales_at_cc5667-1.inclade1.ortho1.aln-cln.fas.gz |
Thanks, @digitalwright, @corneliusroemer, and @roblanf, for looking into the issue and providing reproducible examples. These examples and information will be very useful for fixing the issue. |
I am using IQ-TREE2 on a computer grid that monitors CPU usage. If a process exceeds the number of requested threads it will error, even if more than that number of threads are available. I am receiving errors for IQ-TREE2 (v2.3.6) exceeding the number of threads when using "-T AUTO -ntmax 1". I believe a step within IQ-TREE2 is not obeying ntmax. Looking at the output, it might be reading in the file:
The output above suggests all 8 CPUs may have been used to read the file even though ntmax was 1. Thanks in advance for taking a look at this issue.
The text was updated successfully, but these errors were encountered: