-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Driver Bug (Intel, NVIDIA): segfault when building too many programs at once #110
Comments
Our crack team of valgrinders is on the case... |
If you get a minute could you:
I'm unable to reproduce and I want to see if that segfaults on your system. |
Update: After finally getting the Rust-ocl version to segfault consistently on Intel drivers (but not AMD), I created a an equivalent C++ version, opencl-debug-build-cpp, which also segfaults under the same conditions. This is definitely an Intel driver issue (possibly NVIDIA too) and not specific to this library. I'll keep this issue open for now until I have time to file bug reports where appropriate. Since this is problem is restricted to unrealistic use cases such as valgrind and perhaps benchmarks, I don't consider it worth the expense of adding synchronization mechanisms as a work around at present. It's also unclear whether the size or complexity of the program source code or whether or not the programs are identical are factors. |
Thanks for the heads up, work, and keeping us posted. Appreciate it. |
For Intel CPUs this bug was fixed in https://github.com/intel/compute-runtime (replacement for Beignet) |
I had a construction like
due to me benchmarking a simple prototype function. This crashed when looped too often.
When ran outside valgrind, the program often worked, but within valgrind, I have consitent crashes.
For future reference, you can find this "bad" code on https://gitlab.com/rubdos/multicore-ocl-project, branch
segfault
.When digging deeper, some of the
ocl
unit tests also seem to be affected:these tests all crash when ran within valgrind, ending in a segmentation fault.
The text was updated successfully, but these errors were encountered: