Skip to content

Commit 7f9c890

Browse files
authored
[Hardware][AWS] update neuron to 2.20 (vllm-project#8676)
Signed-off-by: omrishiv <[email protected]>
1 parent 7c8566a commit 7f9c890

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

Dockerfile.neuron

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# default base image
2-
ARG BASE_IMAGE="public.ecr.aws/neuron/pytorch-inference-neuronx:2.1.2-neuronx-py310-sdk2.19.1-ubuntu20.04"
2+
ARG BASE_IMAGE="public.ecr.aws/neuron/pytorch-inference-neuronx:2.1.2-neuronx-py310-sdk2.20.0-ubuntu20.04"
33

44
FROM $BASE_IMAGE
55

@@ -20,7 +20,7 @@ RUN python3 -m pip install --upgrade pip
2020
RUN python3 -m pip install --no-cache-dir fastapi ninja tokenizers pandas
2121
RUN python3 -m pip install sentencepiece transformers==4.36.2 -U
2222
RUN python3 -m pip install transformers-neuronx --extra-index-url=https://pip.repos.neuron.amazonaws.com -U
23-
RUN python3 -m pip install --pre neuronx-cc==2.12.* --extra-index-url=https://pip.repos.neuron.amazonaws.com -U
23+
RUN python3 -m pip install --pre neuronx-cc==2.15.* --extra-index-url=https://pip.repos.neuron.amazonaws.com -U
2424

2525
COPY ./vllm /app/vllm/vllm
2626
COPY ./setup.py /app/vllm/setup.py

requirements-neuron.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,6 @@
22
-r requirements-common.txt
33

44
# Dependencies for Neuron devices
5-
transformers-neuronx >= 0.9.0
6-
torch-neuronx >= 2.1.0
5+
transformers-neuronx >= 0.12.0
6+
torch-neuronx >= 2.1.2
77
neuronx-cc

0 commit comments

Comments
 (0)