-
Notifications
You must be signed in to change notification settings - Fork 907
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test "python3 train.py --model='WaveNet' ",get exception "Conv2DCustomBackpropInputOp only supports NHWC." #140
Comments
I have the same issue when I try to run this on CPU. @DayanJ According to your log, the device placement is CPU, and, I guess, CPU version of Op supports only NHWC order. If you're going to use your GPU, you should fix whatever prevents TensorFlow from placing your OP to GPU, probably by uninstalling CPU version of TensorFlow. If you're going to use CPU, I guess you can fastfix this by reordering before this Op using Upd: tried fastfix, didn't work, I am TensorFlow noob, don't believe me. |
Hello @DayanJ, as suggested by @gloriouskilka, please make sure you only have tensorflow gpu version installed. This is most likely a bug that occurs when you are trying to use CPU on Wavenet. |
Hi, @gloriouskilka , I am DayanJ, this is my new account. I didn't install tf gpu before. |
@Hayes515 Hi! My fastfix is a bad idea, just proof of concept. You should switch to Nvidia GPU, if you have one, because you will train your network on CPU until the end of the days, I think. Usually people install both tensorflow and tensorflow-gpu, and sometimes CPU version of tensorflow prevents GPU to be used, so the main advice is: uninstall tensorflow, install only tensorflow-gpu. |
@gloriouskilka Hi! you are right,I have switched to Nvidia GPU, but it took me two days to finish it. I installed some packages by Anaconda3 in a new environment T3.This way is convinient. The condition of my GPU is below. Thank you! |
@Hayes515 Yay! You're welcome! I guess we can close this issue, because it alredy contains all possible solutions with nice screenshots. |
One last thing before closing this, @Hayes515 you may want to keep your 2nd gpu free as it is holding the model graph for no particular reason. To do this, please add os.environ["CUDA_VISIBLE_DEVICES"] = "0" in the following location: Lines 36 to 37 in e244457
That will prevent the run from seeing your 2nd GPU, it seems your graphic display is handled by it so there you go :) Naturally if you want to make multiple runs in parallel you can follow my comment here. Feel free to close the issue if no other problems are related to this issue. Thanks for using our work ;) |
I needed to run the model on CPU for a testing purpose (because a machine with GPU is currently occupied by another variation of this model) so I would be glad if it could run on CPU. It looks like the "channel" part of the transposed convolution input is temporarily inserted here: Tacotron-2/wavenet_vocoder/models/wavenet.py Lines 467 to 475 in d13dbba
And here: Tacotron-2/wavenet_vocoder/models/wavenet.py Lines 549 to 554 in d13dbba
I guess that this issue, the restriction of the CPU implementation of I'm not entirely sure because this is based on the assumption that these are the only instances where |
Hi guys, I'm trying to get WaveNet training working and I keep getting this problem. I can't find its location and don't know how to fix it. Or is there an update on this issue? I only have tansorflow-gpu installed. Tacotron workout went through without any problems. Exiting due to exception: Conv2DCustomBackpropInputOp only supports NHWC. Caused by op 'WaveNet_model/optimizer_1/gradients/WaveNet_model/inference/final_convolution_2/final_convolution_2_1/final_convolution_2/conv1d/Conv2D_grad/Conv2DBackpropInput', defined at: ...which was originally created as op 'WaveNet_model/inference/final_convolution_2/final_convolution_2_1/final_convolution_2/conv1d/Conv2D', defined at: |
I used LJSpeech-1.1 data to test
1.After I have run 'Python3 wavenet_preprocess.py', I can get these files.
2.I have modifed 'hparams.py' , set "train_with_GTA" to False.
3.After I have run "Python3 train --model='WaveNet' ,I got these errors.
My tensorflow version is 1.7.1 and I can't fix this error.
The text was updated successfully, but these errors were encountered: