GPU not activating? #799

anan1213095357 · 2024-10-23T14:52:34Z

我有一张3060 12g 运行的时候 torch.cuda.is_available() 是 true, 但是我总感觉速度很慢。于是我把 chat.load(compile=False,device="cuda") 改成了 chat.load(compile=False,device="cpu") 他俩都是一个速度。我要怎么解决？

fumiama · 2024-10-30T13:33:17Z

请贴出更详细信息：cuda/cpu下的启动+推理日志。

pinnnkman · 2024-12-05T05:32:58Z

Hi @fumiama, I have encountered this issue also:
python version: 3.11.9
codes:

import os
import time
import scipy.io
import torch
import torch.version
import torchaudio

print("PyTorch Version:", torch.__version__)
print("CUDA Available:", torch.cuda.is_available())
print("CUDA Version:", torch.version.cuda)
print("CUDA Compiled:", torch.cuda._is_compiled())
print("CUDNN Version:", torch.backends.cudnn.version())
print('Using GPU:', torch.cuda.get_device_name(0))
os.environ['TF_ENABLE_ONEDNN_OPTS'] = '0'

import ChatTTS
import scipy
t = time.time()
chat = ChatTTS.Chat()
# chat.load(source='local',compile=False,device=torch.device('cuda:0'))
chat.load(compile=False,device="cuda")
speaker_vector = '-4.741,0.419,-3.355,3.652,-1.682,-1.254,9.719,1.436,0.871,12.334,-0.175,-2.653,-3.132,0.525,1.573,-0.351,0.030,-3.154,0.935,-0.111,-6.306,-1.840,-0.818,9.773,-1.842,-3.433,-6.200,-4.311,1.162,1.023,11.552,2.769,-2.408,-1.494,-1.143,12.412,0.832,-1.203,5.425,-1.481,0.737,-1.487,6.381,5.821,0.599,6.186,5.379,-2.141,0.697,5.005,-4.944,0.840,-4.974,0.531,-0.679,2.237,4.360,0.438,2.029,1.647,-2.247,-1.716,6.338,1.922,0.731,-2.077,0.707,4.959,-1.969,5.641,2.392,-0.953,0.574,1.061,-9.335,0.658,-0.466,4.813,1.383,-0.907,5.417,-7.383,-3.272,-1.727,2.056,1.996,2.313,-0.492,3.373,0.844,-8.175,-0.558,0.735,-0.921,8.387,-7.800,0.775,1.629,-6.029,0.709,-2.767,-0.534,2.035,2.396,2.278,2.584,3.040,-6.845,7.649,-2.812,-1.958,8.794,2.551,3.977,0.076,-2.073,-4.160,0.806,3.798,-1.968,-4.690,5.702,-4.376,-2.396,1.368,-0.707,4.930,6.926,1.655,4.423,-1.482,-3.670,2.988,-3.296,0.767,3.306,1.623,-3.604,-2.182,-1.480,-2.661,-1.515,-2.546,3.455,-3.500,-3.163,-1.376,-12.772,1.931,4.422,6.434,-0.386,-0.704,-2.720,2.177,-0.666,12.417,4.228,0.823,-1.740,1.285,-2.173,-4.285,-6.220,2.479,3.135,-2.790,1.395,0.946,-0.052,9.148,-2.802,-5.604,-1.884,1.796,-0.391,-1.499,0.661,-2.691,0.680,0.848,3.765,0.092,7.978,3.023,2.450,-15.073,5.077,3.269,2.715,-0.862,2.187,13.048,-7.028,-1.602,-6.784,-3.143,-1.703,1.001,-2.883,0.818,-4.012,4.455,-1.545,-14.483,-1.008,-3.995,2.366,3.961,1.254,-0.458,-1.175,2.027,1.830,2.682,0.131,-1.839,-28.123,-1.482,16.475,2.328,-13.377,-0.980,9.557,0.870,-3.266,-3.214,3.577,2.059,1.676,-0.621,-6.370,-2.842,0.054,-0.059,-3.179,3.182,3.411,4.419,-1.688,-0.663,-5.189,-5.542,-1.146,2.676,2.224,-5.519,6.069,24.349,2.509,4.799,0.024,-2.849,-1.192,-16.989,1.845,6.337,-1.936,-0.585,1.691,-3.564,0.931,0.223,4.314,-2.609,0.544,-1.931,3.604,1.248,-0.852,2.991,-1.499,-3.836,1.774,-0.744,0.824,7.597,-1.538,-0.009,0.494,-2.253,-1.293,-0.475,-3.816,8.165,0.285,-3.348,3.599,-4.959,-1.498,-1.492,-0.867,0.421,-2.191,-1.627,6.027,3.667,-21.459,2.594,-2.997,5.076,0.197,-3.305,3.998,1.642,-6.221,3.177,-3.344,5.457,0.671,-2.765,-0.447,1.080,2.504,1.809,1.144,2.752,0.081,-3.700,0.215,-2.199,3.647,1.977,1.326,3.086,34.789,-1.017,-14.257,-3.121,-0.568,-0.316,11.455,0.625,-6.517,-0.244,-8.490,9.220,0.068,-2.253,-1.485,3.372,2.002,-3.357,3.394,1.879,16.467,-2.271,1.377,-0.611,-5.875,1.004,12.487,2.204,0.115,-4.908,-6.992,-1.821,0.211,0.540,1.239,-2.488,-0.411,2.132,2.130,0.984,-10.669,-7.456,0.624,-0.357,7.948,2.150,-2.052,3.772,-4.367,-11.910,-2.094,3.987,-1.565,0.618,1.152,1.308,-0.807,1.212,-4.476,0.024,-6.449,-0.236,5.085,1.265,-0.586,-2.313,3.642,-0.766,3.626,6.524,-1.686,-2.524,-0.985,-6.501,-2.558,0.487,-0.662,-1.734,0.275,-9.230,-3.785,3.031,1.264,15.340,2.094,1.997,0.408,9.130,0.578,-2.239,-1.493,11.034,2.201,6.757,3.432,-4.133,-3.668,2.099,-6.798,-0.102,2.348,6.910,17.910,-0.779,4.389,1.432,-0.649,5.115,-1.064,3.580,4.129,-4.289,-2.387,-0.327,-1.975,-0.892,5.327,-3.908,3.639,-8.247,-1.876,-10.866,2.139,-3.932,-0.031,-1.444,0.567,-5.543,-2.906,1.399,-0.107,-3.044,-4.660,-1.235,-1.011,9.577,2.294,6.615,-1.279,-2.159,-3.050,-6.493,-7.282,-8.546,5.393,2.050,10.068,3.494,8.810,2.820,3.063,0.603,1.965,2.896,-3.049,7.106,-0.224,-1.016,2.531,-0.902,1.436,-1.843,1.129,6.746,-2.184,0.801,-0.965,-7.555,-18.409,6.176,-3.706,2.261,4.158,-0.928,2.164,-3.248,-4.892,-0.008,-0.521,7.931,-10.693,4.320,-0.841,4.446,-1.591,-0.702,4.075,3.323,-3.406,-1.198,-5.518,-0.036,-2.247,-2.638,2.160,-9.644,-3.858,2.402,-2.640,1.683,-0.961,-3.076,0.226,5.106,0.712,0.669,2.539,-4.340,-0.892,0.732,0.775,-2.757,4.365,-2.368,5.368,0.342,-0.655,0.240,0.775,3.686,-4.008,16.296,4.973,1.851,4.747,0.652,-2.117,6.470,2.189,-8.467,3.236,3.745,-1.332,3.583,-2.504,5.596,-2.440,0.995,-2.267,-3.322,3.490,1.156,1.716,0.669,-3.640,-1.709,5.055,6.265,-3.963,2.863,14.129,5.180,-3.590,0.393,0.234,-3.978,6.946,-0.521,1.925,-1.497,-0.283,0.895,-3.969,5.338,-1.808,-3.578,2.699,2.728,-0.895,-2.175,-2.717,2.574,4.571,1.131,2.187,3.620,-0.388,-3.685,0.979,2.731,-2.164,1.628,-1.006,-7.766,-11.033,-10.985,-2.413,-1.967,0.790,0.826,-1.623,-1.783,3.021,1.598,-0.931,-0.605,-1.684,1.408,-2.771,-2.354,5.564,-2.296,-4.774,-2.830,-5.149,2.731,-3.314,-1.002,3.522,3.235,-1.598,1.923,-2.755,-3.900,-3.519,-1.673,-2.049,-10.404,6.773,1.071,0.247,1.120,-0.794,2.187,-0.189,-5.591,4.361,1.772,1.067,1.895,-5.649,0.946,-2.834,-0.082,3.295,-7.659,-0.128,2.077,-1.638,0.301,-0.974,4.331,11.711,4.199,1.545,-3.236,-4.404,-1.333,0.623,1.414,-0.240,-0.816,-0.808,-1.382,0.632,-5.238,0.120,10.634,-2.026,1.702,-0.469,1.252,1.173,3.015,-8.798,1.633,-5.323,2.149,-6.481,11.635,3.072,5.642,5.252,4.702,-3.523,-0.594,4.150,1.392,0.554,-4.377,3.646,-0.884,1.468,0.779,2.372,-0.101,-5.702,0.539,-0.440,5.149,-0.011,-1.899,-1.349,-0.355,0.076,-0.100,-0.004,5.346,6.276,0.966,-3.138,-2.633,-3.124,3.606,-3.793,-3.332,2.359,-0.739,-3.301,-2.775,-0.491,3.283,-1.394,-1.883,1.203,1.097,2.233,2.170,-2.980,-15.800,-6.791,-0.175,-4.600,-3.840,-4.179,6.568,5.935,-0.431,4.623,4.601,-1.726,0.410,2.591,4.016,8.169,1.763,-3.058,-1.340,6.276,4.682,-0.089,1.301,-4.817'
 # 768维向量
speaker = torch.tensor([float(x) for x in speaker_vector.split(',')])
infer_code = chat.InferCodeParams()
infer_code.spk_emb = speaker
infer_code.temperature = .1
infer_code.top_K = 1
infer_code.top_P = 0.9
infer_code.prompt = "[speed_5]"

textParams = chat.RefineTextParams()
textParams.temperature = 10
textParams.manual_seed = 2222
textParams.prompt = '[oral_2][laugh_0][break_6]'
print('load时间',time.time()-t)

t = time.time()
texts = ['我找到了以下与三里屯相关的内容，']
# wavs = chat.infer(texts)
wavs = chat.infer(texts,use_decoder=False,do_text_normalization=False,skip_refine_text=True,params_infer_code=infer_code,params_refine_text=textParams)
scipy.io.wavfile.write(filename='output.mp3',rate=24_000,data=wavs[0].T)
print('第一次',time.time()-t)

t = time.time()
texts = ['我找到了以下与三里屯相关的内容，']
wavs = chat.infer(texts,use_decoder=False,do_text_normalization=False,skip_refine_text=True,params_infer_code=infer_code,params_refine_text=textParams)
scipy.io.wavfile.write(filename='output.mp3',rate=24_000,data=wavs[0].T)
print('第二次',time.time()-t)

t = time.time()
texts = ['我找到了以下与三里屯相关的内容，']
wavs = chat.infer(texts,use_decoder=False,do_text_normalization=False,skip_refine_text=True,params_infer_code=infer_code,params_refine_text=textParams)
scipy.io.wavfile.write(filename='output.mp3',rate=24_000,data=wavs[0].T)
print('第三次',time.time()-t)

t = time.time()
texts = ['我找到了以下与三里屯相关的内容，']
wavs = chat.infer(texts)
scipy.io.wavfile.write(filename='output.mp3',rate=24_000,data=wavs[0].T)
print('第四次',time.time()-t)

output:

PyTorch Version: 2.5.1+cu118
CUDA Available: True
CUDA Version: 11.8
CUDA Compiled: True
CUDNN Version: 90100
Using GPU: NVIDIA RTX A4000
load时间 4.457225322723389
code:   0%|                                                                                                                                                      | 1/2048(max) [00:00,  5.09it/s]We detected that you are passing `past_key_values` as a tuple of tuples. This is deprecated and will be removed in v4.47. Please convert your cache or use an appropriate `Cache` class (https://huggingface.co/docs/transformers/kv_cache#legacy-cache-format)
code:   7%|██████████▍                                                                                                                                         | 145/2048(max) [00:03, 39.60it/s]
第一次 5.878069877624512
code:   7%|██████████▊                                                                                                                                         | 150/2048(max) [00:03, 42.24it/s]
第二次 3.6012368202209473
code:   7%|██████████▉                                                                                                                                         | 151/2048(max) [00:03, 45.57it/s]
第三次 3.3763091564178467
text:   5%|███████▍                                                                                                                                              | 19/384(max) [00:00, 39.66it/s]
code:   7%|██████████▏                                                                                                                                         | 141/2048(max) [00:03, 41.71it/s] 
第四次 3.8885786533355713

Could you please help me to review this, thanks in advance~

fumiama · 2024-12-05T05:40:37Z

你ChatTTS疑似版本有点老，请先升级最新版再尝试。

pinnnkman · 2024-12-05T05:42:56Z

@fumiama 您好, 我用的main分支的代码，我该切换分支还是pip安装chattts？

fumiama · 2024-12-05T05:46:49Z

不用。如果是最新代码，那应该已经正常使用了GPU，速度就是这样。但是你也可以尝试开启vLLM，这个功能目前仍在实验，有不少不支持的操作。详情参考README。

pinnnkman · 2024-12-05T05:58:32Z

@fumiama Oh, interesting, 由于之前用的PaddleSpeech, 它推理速度到了一秒内让我误以为我这边出了问题，谢谢, 接下来我会尝试使用CUDA环境的torch指定cpu看下推理速度，然后再安装cpu环境的torch确认下这个问题，十分感谢！

pinnnkman · 2024-12-05T06:50:13Z

您好 @fumiama,感觉使用CUDA和不使用CUDA的性能区别不大:
上面9次是

chat.load(compile=False, device="cpu")

下面是

chat.load(compile=False, device="cuda")

(test_ChatTTS) C:\Workspace\Repo_Git\ChatTTS>c:/Workspace/Repo_Git/ChatTTS/test_ChatTTS/Scripts/python.exe c:/Workspace/Repo_Git/ChatTTS/test.py
PyTorch Version: 2.5.1+cu118
CUDA Available: True
CUDA Version: 11.8
CUDA Compiled: True
CUDNN Version: 90100
load时间 4.339531183242798
code:   0%|                                                                                                                                                      | 1/2048(max) [00:00,  5.63it/s]We detected that you are passing `past_key_values` as a tuple of tuples. This is deprecated and will be removed in v4.47. Please convert your cache or use an appropriate `Cache` class (https://huggingface.co/docs/transformers/kv_cache#legacy-cache-format)
code:   7%|██████████▋                                                                                                                                         | 148/2048(max) [00:03, 40.13it/s]
第1次 5.834841728210449
code:   7%|██████████▉                                                                                                                                         | 152/2048(max) [00:03, 43.28it/s]
第2次 3.5652668476104736
code:   7%|██████████▎                                                                                                                                         | 142/2048(max) [00:03, 46.00it/s]
第3次 3.1287615299224854
code:   7%|██████████▋                                                                                                                                         | 148/2048(max) [00:03, 43.91it/s]
第4次 3.390197515487671
code:   7%|██████████▊                                                                                                                                         | 150/2048(max) [00:03, 42.90it/s]
第5次 3.5418283939361572
code:   7%|██████████▊                                                                                                                                         | 149/2048(max) [00:03, 46.37it/s]
第6次 3.2616772651672363
code:   7%|██████████▊                                                                                                                                         | 150/2048(max) [00:03, 42.73it/s]
第7次 3.547151565551758
code:   7%|██████████▊                                                                                                                                         | 149/2048(max) [00:03, 40.61it/s]
第8次 3.699723482131958
code:   7%|██████████▋                                                                                                                                         | 148/2048(max) [00:03, 45.67it/s] 
第9次 3.2647745609283447

(test_ChatTTS) C:\Workspace\Repo_Git\ChatTTS>c:/Workspace/Repo_Git/ChatTTS/test_ChatTTS/Scripts/python.exe c:/Workspace/Repo_Git/ChatTTS/test.py
PyTorch Version: 2.5.1+cu118
CUDA Available: True
CUDA Version: 11.8
CUDA Compiled: True
CUDNN Version: 90100
load时间 3.498683452606201
code:   0%|                                                                                                                                                      | 1/2048(max) [00:00,  5.13it/s]We detected that you are passing `past_key_values` as a tuple of tuples. This is deprecated and will be removed in v4.47. Please convert your cache or use an appropriate `Cache` class (https://huggingface.co/docs/transformers/kv_cache#legacy-cache-format)
code:   7%|██████████▉                                                                                                                                         | 151/2048(max) [00:05, 25.97it/s]
第1次 7.810804128646851
code:   8%|███████████▎                                                                                                                                        | 156/2048(max) [00:06, 25.45it/s]
第2次 6.220078945159912
code:   7%|██████████▎                                                                                                                                         | 143/2048(max) [00:05, 25.33it/s]
第3次 5.733705043792725
code:   7%|██████████▋                                                                                                                                         | 148/2048(max) [00:05, 25.08it/s]
第4次 6.0042078495025635
code:   7%|██████████▋                                                                                                                                         | 148/2048(max) [00:05, 25.61it/s]
第5次 5.844567537307739
code:   7%|██████████▍                                                                                                                                         | 144/2048(max) [00:05, 25.88it/s]
第6次 5.660144567489624
code:   7%|██████████▎                                                                                                                                         | 143/2048(max) [00:05, 24.84it/s]
第7次 5.823593616485596
code:   7%|██████████▏                                                                                                                                         | 141/2048(max) [00:06, 23.44it/s]
第8次 6.095613241195679
code:   8%|███████████▏                                                                                                                                        | 154/2048(max) [00:06, 25.22it/s]
第9次 6.1926469802856445

fumiama · 2024-12-06T07:27:45Z

看起来CUDA比CPU还慢，可以在任务管理器看看GPU有无跑满。

fumiama added the question Further information is requested label Oct 30, 2024

github-actions bot added the stale The topic has been ignored for a long time label Nov 30, 2024

github-actions bot removed the stale The topic has been ignored for a long time label Dec 6, 2024

github-actions bot added the stale The topic has been ignored for a long time label Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU not activating? #799

GPU not activating? #799

anan1213095357 commented Oct 23, 2024

fumiama commented Oct 30, 2024

pinnnkman commented Dec 5, 2024

fumiama commented Dec 5, 2024

pinnnkman commented Dec 5, 2024

fumiama commented Dec 5, 2024

pinnnkman commented Dec 5, 2024

pinnnkman commented Dec 5, 2024

fumiama commented Dec 6, 2024

GPU not activating? #799

GPU not activating? #799

Comments

anan1213095357 commented Oct 23, 2024

fumiama commented Oct 30, 2024

pinnnkman commented Dec 5, 2024

fumiama commented Dec 5, 2024

pinnnkman commented Dec 5, 2024

fumiama commented Dec 5, 2024

pinnnkman commented Dec 5, 2024

pinnnkman commented Dec 5, 2024

fumiama commented Dec 6, 2024