-
Notifications
You must be signed in to change notification settings - Fork 189
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running iterative alg stuck in Ubuntu system #552
Comments
There are a couple of rare issues that may be causing this, but its been hard to debug because I can't reproduce it. One thing to try: in the following function, a new geoemtry is created from the input one.
Can you try changing the code locally so it doesn't do this modification of the geoemtry? Just the copy. |
Do you mean comment this line right? I tried and failed. But I tried some Krylov subspace algorithms like CGLS and LSQR it worked, That is weired. But the OSART-TV's performence is the best... |
@stefenmax not just that line, but the few after. |
Thanks for you help. But it still didn't works. Maybe I should run it using windows. And I found that the speed is faster than linux lol |
hum... I don't really know then why. I'll keep the issue open, if you do happen to pinpoint what exactly hangs (has to be some Ax() or Atb() call somewhere) do let me know. I do suspect its |
I found that I can run the ossart algogrithm in the example.py in my linux system. So I tried replace my geometry using the head phantom and found it hangg in the tigre.Ax. That is weired cause previously I could do the Ax and FDK for my own data. Here is the example code, I don't know if you can reproduce this.
|
So it hangs in the Ax in this code? |
Certainly with different GPUs behaviour is undefined, so that would be an issue. I'll try your specific geometry. But out of curiosity, if you change the nvoxel/ndetector a bit, does it still hang? |
Do you have any recommendation on how to change the nvoxel/ndetector? |
Just give it a different value, just to see if its the specific values causing the issue. |
Yes,after change it a bit. Still hang |
Apologies, I don't seem to be able to reproduce this in any way. If you can pinpoint where the error is, do let me know. |
I have the same issue on Ubuntu. The code hangs here on my machine (I haven't stepped through the CUDA yet):
If W = Ax(
# np.ones(geox.nVoxel, dtype=np.float32), geox, self.angles, "Siddon", gpuids=self.gpuids
np.ones(geox.nVoxel, dtype=np.float32), geox, self.angles, "interpolated", gpuids=self.gpuids
) |
This is where the code hangs inside TIGRE/Common/CUDA/Siddon_projection.cu Line 519 in b8e2e95
|
Thanks @timcogan ! Its strange that means that some of the previous stuff gets into some infinite loop. Its hard to debug because its parallel code that I can't stop, but this information helps a lot actually. |
What if you set the code to only use 1 GPU? does it still hang? |
Yes, it hangs when using only 1 GPU. |
Hi, I run your code smoothly on Windows, when I transfer to linux, after compile, it could run the forward and backprojection on my data. But every time When I run OSART-TV like below, it will stuck with no response. In windows it give me response within few seconds.
algs.ossart_tv(proj, self.geo, angles, niter=1, init = init)
Thanks for your help
Specifications
conda list
The text was updated successfully, but these errors were encountered: