Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

我开始训练这报错了 #22

Open
kuuga314 opened this issue Oct 23, 2022 · 1 comment
Open

我开始训练这报错了 #22

kuuga314 opened this issue Oct 23, 2022 · 1 comment

Comments

@kuuga314
Copy link

FP16 Run: False
Dynamic Loss Scaling: True
Distributed Run: False
cuDNN Enabled: True
cuDNN Benchmark: False
Warm starting model from checkpoint 'tacotron2_statedict.pt'
0%
0/614 [00:13<?, ?it/s]

Starting Epoch: 0 Iteration: 0
0%
0/1 [00:13<?, ?it/s]

RuntimeError Traceback (most recent call last)
/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py in _try_get_data(self, timeout)
1162 try:
-> 1163 data = self._data_queue.get(timeout=timeout)
1164 return (True, data)

13 frames
RuntimeError: DataLoader worker (pid 1469) is killed by signal: Killed.

The above exception was the direct cause of the following exception:

RuntimeError Traceback (most recent call last)
/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py in _try_get_data(self, timeout)
1174 if len(failed_workers) > 0:
1175 pids_str = ', '.join(str(w.pid) for w in failed_workers)
-> 1176 raise RuntimeError('DataLoader worker (pid(s) {}) exited unexpectedly'.format(pids_str)) from e
1177 if isinstance(e, queue.Empty):
1178 return (False, None)

RuntimeError: DataLoader worker (pid(s) 1469) exited unexpectedly

@StarxSky
Copy link
Contributor

@kuuga314 您好!关于您的这个问题是由于您的DataLoader的worker_num的数量设置的不合理,导致线程堵塞(数据堵塞)请尝试将DataLoader的num_workers设置为0或者是1,希望这对您的问题有所帮助!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants