Skip to content

Commit d92a609

Browse files
author
Vladimir Leskov
committed
Fix issue with cuda.
Details: vllm-project#963
1 parent 606aab2 commit d92a609

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/llmcompressor/modifiers/utils/pytorch_helpers.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,7 @@ def run_calibration_forward(
102102

103103
# TODO: not ideal, figure out where we aren't freeing memory instead
104104
# currently without this we run OOM on the 2nd forward pass
105-
torch.cuda.empty_cache()
105+
# torch.cuda.empty_cache()
106106

107107
return intermediates
108108

0 commit comments

Comments
 (0)