Skip to content

Double DrJIT Loop #1120

Answered by njroussel
Angom8 asked this question in Q&A
Mar 28, 2024 · 1 comments · 1 reply
Discussion options

You must be logged in to vote

Hi @Angom8

I think I understood what you're doing.

The high memory usage is somewhat expected, I believe. With this approach you effectively need to write pos to global memory, so that requires a storage of N_rays * 3 * 4 bytes. In general, you want to avoid storing anything that scales with your number of rays (that's one of the goals of megakernels). You should be able to write this without any scatter to generate pos.

I'm confused about the passses here. Are you not updating the positions between them? If so, why are they ran separately? I don't think this matters in any case, but I think I'm misunderstanding your explanation.

Fundamentally, I don't think there is any reason why you sh…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Angom8
Comment options

Answer selected by njroussel
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants