lora doesn't work when kv_cache is disabled #2543

ShuaiShao93 · 2024-12-05T20:14:16Z

System Info

x86_64, debian 11, L4 gpu

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Install trtllm 0.15.0
Build engine with awq int4 and lora, disable kv cache
Run TensorRT-LLM/examples/run.py by passing lora dir

Expected behavior

Get results

actual behavior

Segmentation fault

additional notes

N/A

The text was updated successfully, but these errors were encountered:

ShuaiShao93 added the bug Something isn't working label Dec 5, 2024

nv-guomingz added the Lora/P-tuning label Dec 6, 2024

github-actions bot added triaged Issue has been triaged by maintainers Investigating labels Dec 6, 2024

nv-guomingz added Lora/P-tuning and removed bug Something isn't working triaged Issue has been triaged by maintainers Investigating Lora/P-tuning labels Dec 10, 2024

github-actions bot added triaged Issue has been triaged by maintainers Investigating labels Dec 10, 2024

github-actions bot assigned byshiue Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lora doesn't work when kv_cache is disabled #2543

lora doesn't work when kv_cache is disabled #2543

ShuaiShao93 commented Dec 5, 2024

lora doesn't work when kv_cache is disabled #2543

lora doesn't work when kv_cache is disabled #2543

Comments

ShuaiShao93 commented Dec 5, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes