You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm encountering a RuntimeError when trying to run a script with the DeepSeek-VL2-tiny model for image and language processing. Specifically, the error occurs during the forward pass in the generate method of the vl_gpt.language object.
Error Message:
Traceback (most recent call last):
File "/home/yiqiao/Desktop/DeepSeek-VL2/mydemo.py", line 57, in <module>
outputs = vl_gpt.language.generate(
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
return func(*args, **kwargs)
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/transformers/generation/utils.py", line 2252, in generate
result = self._sample(
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/transformers/generation/utils.py", line 3251, in _sample
outputs = self(**model_inputs, return_dict=True)
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
File "/home/yiqiao/Desktop/DeepSeek-VL2/deepseek_vl/models/modeling_deepseek.py", line 1723, in forward
outputs = self.model(
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
File "/home/yiqiao/Desktop/DeepSeek-VL2/deepseek_vl/models/modeling_deepseek.py", line 1592, in forward
layer_outputs = decoder_layer(
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
File "/home/yiqiao/Desktop/DeepSeek-VL2/deepseek_vl/models/modeling_deepseek.py", line 1306, in forward
hidden_states, self_attn_weights, present_key_value = self.self_attn(
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
File "/home/yiqiao/miniconda3/envs/vita/lib/python3.10/site-packages/transformers/models/llama/modeling_llama.py", line 309, in forward
query_states = query_states.view(bsz, q_len, -1, self.head_dim).transpose(1, 2)
RuntimeError: cannot reshape tensor of 0 elements into shape [1, 0, -1, 128] because the unspecified dimension size -1 can be any value and is ambiguous
I'm encountering a
RuntimeError
when trying to run a script with theDeepSeek-VL2-tiny
model for image and language processing. Specifically, the error occurs during the forward pass in thegenerate
method of thevl_gpt.language
object.Error Message:
Environment:
The text was updated successfully, but these errors were encountered: