Skip to content

Commit 2b40863

Browse files
yewentao256usberkeley
authored andcommitted
[Feature] Batch Invariant for R1 TP 8 on Blackwell (vllm-project#27229)
Signed-off-by: yewentao256 <[email protected]>
1 parent 0143de9 commit 2b40863

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/layers/batch_invariant.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -756,10 +756,10 @@ def override_envs_for_invariance():
756756
"FLEX_ATTENTION",
757757
"FLASHINFER",
758758
"FLASH_ATTN_MLA",
759+
"FLASHINFER_MLA",
759760
"TRITON_MLA",
760761
# Not yet supported MLA backends
761762
# "FLASHMLA",
762-
# "FLASHINFER_MLA",
763763
]
764764
if curr_attn_backend not in supported_backends:
765765
warning = (

0 commit comments

Comments
 (0)