Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Segfault CUDA 12.2 #4

Open
sshleifer opened this issue Mar 19, 2024 · 1 comment
Open

Segfault CUDA 12.2 #4

sshleifer opened this issue Mar 19, 2024 · 1 comment

Comments

@sshleifer
Copy link

Which versions of pytorch/triton/hardware do you run this on?

Traceback

tests/test_mlp.py Fatal Python error: Segmentation fault

Thread 0x00007f23c08b5640 (most recent call first):
  <no Python frame>

...

Thread 0x00007f23c28b9640 (most recent call first):
  <no Python frame>

My Env

I have CUDA 12.1, H100

triton==2.1.0+git17d633a64
torch==2.0.1+gite9ebda2

What I ran:

git clone [email protected]:shawntan/scattermoe.git
pip install -e .
CUDA_LAUNCH_BLOCKING=1 pytest tests/  --maxfail=1
@findmyway
Copy link

I use nvcr.io/nvidia/pytorch:23.10-py3

torch: 2.1.0a0+32f93b1
triton: 2.2.0
CUDA: 12.2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants