Skip to content

Conversation

@nngokhale
Copy link
Contributor

  1. Increase reserved memory
  2. Round down max num seqs
  3. Disable prefix caching

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
f71952c1c49fb86686b0b300b727b26282362bf4

@nngokhale nngokhale force-pushed the plugin-cd-0.11.0_wa1 branch from 4cf55b9 to 76fd021 Compare October 17, 2025 12:30
@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
f71952c1c49fb86686b0b300b727b26282362bf4

@mgawarkiewicz-intel mgawarkiewicz-intel enabled auto-merge (squash) November 3, 2025 14:35
@mgawarkiewicz-intel mgawarkiewicz-intel merged commit ec41590 into vllm-project:releases/v0.11.0 Nov 3, 2025
4 checks passed
@github-actions
Copy link

github-actions bot commented Nov 3, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
f71952c1c49fb86686b0b300b727b26282362bf4

nngokhale added a commit to nngokhale/vllm-gaudi that referenced this pull request Nov 17, 2025
1. Increase reserved memory
2. Round down max num seqs
3. Disable prefix caching

Signed-off-by: Neelesh Gokhale <[email protected]>
Co-authored-by: Michal Gawarkiewicz <[email protected]>
nngokhale added a commit to nngokhale/vllm-gaudi that referenced this pull request Nov 19, 2025
1. Increase reserved memory
2. Round down max num seqs
3. Disable prefix caching

Signed-off-by: Neelesh Gokhale <[email protected]>
Co-authored-by: Michal Gawarkiewicz <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants