You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The version of Transformers is 4.47.1, but ms-swift raise the following error:
ValueError: The checkpoint you are trying to load has model type deepseek_vl2 but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
The version of Transformers is 4.47.1, but ms-swift raise the following error:
ValueError: The checkpoint you are trying to load has model type
deepseek_vl2but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
ms-swift:
CUDA_VISIBLE_DEVICES=0,1,2,3 \ VIDEO_MAX_PIXELS=50176 \ swift sft \ --model /gpu/nfs/raymodel/deepseek-ai/deepseek-vl2-small \ --train_type lora \ --dataset accident/train.jsonl \ --num_train_epochs 10 \ --per_device_train_batch_size 2 \ --learning_rate 1e-4 \ --lora_rank 8 \ --lora_alpha 32 \ --gradient_accumulation_steps 16 \ --eval_steps 50 \ --save_steps 50 \ --save_total_limit 2 \ --logging_steps 10 \ --target_modules all-linear \ --freeze_llm false \ --freeze_vit false \ --freeze_aligner false
The text was updated successfully, but these errors were encountered: