Segfault CUDA 12.2 #4

sshleifer · 2024-03-19T20:31:53Z

Which versions of pytorch/triton/hardware do you run this on?

Traceback

tests/test_mlp.py Fatal Python error: Segmentation fault

Thread 0x00007f23c08b5640 (most recent call first):
  <no Python frame>

...

Thread 0x00007f23c28b9640 (most recent call first):
  <no Python frame>

My Env

I have CUDA 12.1, H100

triton==2.1.0+git17d633a64
torch==2.0.1+gite9ebda2

What I ran:

git clone [email protected]:shawntan/scattermoe.git
pip install -e .
CUDA_LAUNCH_BLOCKING=1 pytest tests/  --maxfail=1

The text was updated successfully, but these errors were encountered:

findmyway · 2024-03-23T07:14:56Z

I use nvcr.io/nvidia/pytorch:23.10-py3

torch: 2.1.0a0+32f93b1
triton: 2.2.0
CUDA: 12.2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segfault CUDA 12.2 #4

Segfault CUDA 12.2 #4

sshleifer commented Mar 19, 2024

findmyway commented Mar 23, 2024

Segfault CUDA 12.2 #4

Segfault CUDA 12.2 #4

Comments

sshleifer commented Mar 19, 2024

Traceback

My Env

What I ran:

findmyway commented Mar 23, 2024