Skip to content

how to avoid oom when inference qwen2-vl 7B with batch=2? #125

how to avoid oom when inference qwen2-vl 7B with batch=2?

how to avoid oom when inference qwen2-vl 7B with batch=2? #125

Triggered via issue December 11, 2024 08:58
@sunnyqggsunnyqgg
commented on #2496 aaacc9b
Status Skipped
Total duration 5s
Artifacts

blossom-ci.yml

on: issue_comment
Authorization
0s
Authorization
Upload log
0s
Upload log
Vulnerability scan
0s
Vulnerability scan
Start ci job
0s
Start ci job
Fit to window
Zoom out
Zoom in