You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
position_ids[..., i, attention_mask[i] == 1] = llm_positions.to(position_ids.device)
RuntimeError: shape mismatch: value tensor of shape [3, 4255] cannot be broadcast to indexing result of shape [3, 4095]
Expected behavior
......
Others
No response
The text was updated successfully, but these errors were encountered:
Reminder
System Info
template: qwen2_vl
cutoff_len: 4096
max_samples: 1000
overwrite_cache: true
preprocessing_num_workers: 16
Reproduction
position_ids[..., i, attention_mask[i] == 1] = llm_positions.to(position_ids.device)
RuntimeError: shape mismatch: value tensor of shape [3, 4255] cannot be broadcast to indexing result of shape [3, 4095]
Expected behavior
......
Others
No response
The text was updated successfully, but these errors were encountered: