TensorRT-LLM often hangs using both tp_size 2
and enable_context_fmha
.
#390
Labels
bug
Something isn't working
tp_size 2
and enable_context_fmha
.
#390
System Info
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
It works well without hanging.
actual behavior
TensorRT-LLM often hangs using both
tp_size 2
andenable_context_fmha
.additional notes
NA
The text was updated successfully, but these errors were encountered: