Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark mad crash on Jetson Nano #18

Open
tpoisonooo opened this issue Mar 22, 2022 · 1 comment
Open

Benchmark mad crash on Jetson Nano #18

tpoisonooo opened this issue Mar 22, 2022 · 1 comment

Comments

@tpoisonooo
Copy link
Contributor

tpoisonooo commented Mar 22, 2022

mad_throughput crashed on Jetson Nano and throw VK_ERROR_DEVICE_LOST.

This is call chain:

mad_throughput_main.cc:189  --->   GetDeviceBufferViaStagingBuffer -->  vulkan_buffer_util.cc:67 ---->  QueueSubmitAndWait ---> crash

No nullptr or bad variable found.

I have tried to fix it by validation layer, but Jetson Nano does not support it ... 0 == layerCount

$ vulkaninfo
Instance Extensions:
====================
Instance Extensions	count = 16
	VK_KHR_device_group_creation        : extension revision  1
	VK_KHR_display                      : extension revision 23
	VK_KHR_external_fence_capabilities  : extension revision  1
	VK_KHR_external_memory_capabilities : extension revision  1
	VK_KHR_external_semaphore_capabilities: extension revision  1
	VK_KHR_get_display_properties2      : extension revision  1
	VK_KHR_get_physical_device_properties2: extension revision  2
	VK_KHR_get_surface_capabilities2    : extension revision  1
	VK_KHR_surface                      : extension revision 25
	VK_KHR_surface_protected_capabilities: extension revision  1
	VK_KHR_wayland_surface              : extension revision  6
	VK_KHR_xcb_surface                  : extension revision  6
	VK_KHR_xlib_surface                 : extension revision  6
	VK_EXT_debug_report                 : extension revision  9
	VK_EXT_debug_utils                  : extension revision  1
	VK_EXT_display_surface_counter      : extension revision  1
Layers: count = 0

this is my draft PR #17

@antiagainst
Copy link
Collaborator

VK_ERROR_DEVICE_LOST is an indication that the workload is taking too much time to complete on the GPU (given we have a weak GPU on Jetson Nano I think). You can try to reduce the amount of workload to see if it helps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants