Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Disable the exllama on all non-cuda devices. (#2003)
* Disable the exllama on all non-cuda devices. 1. Disable the exllama on all non-cuda devices. 2. Don't raise the error when running on non-cuda device. Signed-off-by: yuanwu <[email protected]> * Refine the code Signed-off-by: yuanwu <[email protected]> * Fix errors of make style Signed-off-by: yuanwu <[email protected]> * Add hpu device Signed-off-by: yuanwu <[email protected]> * Update optimum/gptq/constants.py Co-authored-by: Ilyas Moutawwakil <[email protected]> * Update optimum/gptq/quantizer.py Co-authored-by: Ilyas Moutawwakil <[email protected]> * Update optimum/gptq/quantizer.py Co-authored-by: Ilyas Moutawwakil <[email protected]> * Update optimum/gptq/quantizer.py Co-authored-by: Ilyas Moutawwakil <[email protected]> * Fix error of make style Signed-off-by: yuanwu <[email protected]> --------- Signed-off-by: yuanwu <[email protected]> Co-authored-by: Ilyas Moutawwakil <[email protected]>
- Loading branch information