You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I met the following error that keeps interrupting my training process.
This happened after 1000 steps and is a ONNXRuntimeError error
Could you help to check if there is anything wrong?
Environment:
1x RTX 5880 Ada 48G (Ada architecture)
batch size: 1
python environment: conda environment you provided
Hi @xumingw
Inference works well. Is it possible the onnx version you provided is compatible with Ampere architecture, but not with Ada architecture..?
Any suggestions?
Dear Sir or Madam,
I met the following error that keeps interrupting my training process.
This happened after 1000 steps and is a
ONNXRuntimeError
errorCould you help to check if there is anything wrong?
Environment:
commands:
CUDA_VISIBLE_DEVICES=1 accelerate launch -m --config_file accelerate_config.yaml --machine_rank 0 --main_process_ip 0.0.0.0 --main_process_port 20055 --num_machines 1 --num_processes 1 scripts.train_stage1 --config ./configs/train/stage1.yaml
error:
Looking forward your reply. Thanks.
The text was updated successfully, but these errors were encountered: