You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Whisper Large-V2 encoder hangs when building the engine
this is the last thing printed before it hangs: [11/27/2024-23:00:09] [TRT] [V] Disabling unused tactic source: JIT_CONVOLUTIONS
I tried waiting for around an hour but it still didn't complete and nvidia-smi memory usage for the process was constant
Tiny model builds fine but it's much slower than older versions
additional notes
None
The text was updated successfully, but these errors were encountered:
@yuekaizhang I'm sticking to v0.15 for now, in the mean time this issue regarding fetching the encoder output from the executor response is also a priority so we can have word timestamps #2338 (comment)
System Info
Who can help?
@byshiue
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
trtllm-build --checkpoint_dir trt/whisper_large-v2_weights_int8/encoder \ --output_dir trt/whisper_large-v2_int8_2/encoder \ --kv_cache_type paged \ --moe_plugin disable \ --enable_xqa enable \ --max_batch_size 16 \ --gemm_plugin disable \ --bert_attention_plugin float16 \ --max_input_len 3000 \ --max_seq_len 3000 \ --log_level debug \ --enable_debug_output
Expected behavior
Engine builds successfully and fast
actual behavior
Whisper Large-V2 encoder hangs when building the engine
this is the last thing printed before it hangs:
[11/27/2024-23:00:09] [TRT] [V] Disabling unused tactic source: JIT_CONVOLUTIONS
I tried waiting for around an hour but it still didn't complete and
nvidia-smi
memory usage for the process was constantTiny model builds fine but it's much slower than older versions
additional notes
None
The text was updated successfully, but these errors were encountered: