-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: CUDA out of memory. #8
Comments
Hello @Saharkakavand. Thank you for opening an issue and trying out EmbedSeg! |
@MLbyML , thank your for your replay, |
sorry, I I have 5 images with these shapes in data directory |
Okay, so these look like confocal volume images since the size of the z dimension appears almost the same as x and y dimensions, is that correct? |
Hello @MLbyML , thank you for your reply.
Creating branched erfnet 3d with [6, 1] classes 0%| | 0/2 [00:25<?, ?it/s] RuntimeError Traceback (most recent call last) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/EmbedSeg/test.py in begin_evaluating(test_configs, verbose, mask_region, mask_intensity, avg_bg) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/EmbedSeg/test.py in test_3d(verbose, grid_x, grid_y, grid_z, pixel_x, pixel_y, pixel_z, one_hot, mask_region, mask_intensity, avg_bg) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/modules/module.py in call(self, *input, **kwargs) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/parallel/data_parallel.py in forward(self, *inputs, **kwargs) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/parallel/data_parallel.py in parallel_apply(self, replicas, inputs, kwargs) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/parallel/parallel_apply.py in parallel_apply(modules, inputs, kwargs_tup, devices) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/parallel/parallel_apply.py in _worker(i, module, input, kwargs, device) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/modules/module.py in call(self, *input, **kwargs) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/EmbedSeg/models/BranchedERFNet_3d.py in forward(self, input, only_encode) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/EmbedSeg/models/BranchedERFNet_3d.py in (.0) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/EmbedSeg/models/erfnet_3d.py in forward(self, input) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/modules/module.py in call(self, *input, **kwargs) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/EmbedSeg/models/erfnet_3d.py in forward(self, input) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/modules/module.py in call(self, *input, **kwargs) /beegfs/desy/user/kakavs/miniconda3/envs/fiber/lib/python3.7/site-packages/torch/nn/modules/conv.py in forward(self, input) RuntimeError: cuDNN error: CUDNN_STATUS_INTERNAL_ERROR |
Hello @Saharkakavand Thanks for giving this a go. |
The |
I have 4 images, and batch size is only 1. but when I start the
begin_training(train_dataset_dict, val_dataset_dict, model_dict, loss_dict, configs)
, I have RuntimeError: CUDA out of memory. Tried to allocate 128.00 MiB (GPU 0; 31.75 GiB total capacity; 30.71 GiB already allocated; 62.50 MiB free; 12.93 MiB cached). Please let me know how can I solve it.Thanks
The text was updated successfully, but these errors were encountered: