Skip to content

gpt_big_code: make flash attention impl quantization friendly #5117

gpt_big_code: make flash attention impl quantization friendly

gpt_big_code: make flash attention impl quantization friendly #5117

Re-run triggered September 25, 2024 15:32
Status Failure
Total duration 25m 21s
Artifacts

fast_tests.yml

on: pull_request
Run tests for optimum.habana.transformers
5m 28s
Run tests for optimum.habana.transformers
Run tests for optimum.habana.diffusers
25m 4s
Run tests for optimum.habana.diffusers
Fit to window
Zoom out
Zoom in

Annotations

1 error and 4 warnings
Run tests for optimum.habana.diffusers
Process completed with exit code 2.
Run tests for optimum.habana.transformers
The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/
Run tests for optimum.habana.transformers
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
Run tests for optimum.habana.diffusers
The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/
Run tests for optimum.habana.diffusers
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/