SageMaker deployment errors #94

jonrossclaytor · 2023-07-10T19:23:05Z

Background

We are attempting to deploy SageMaker Endpoints using the code provided under Deploy - Amazon SageMaker from huggingface.co for these two models:

https://huggingface.co/Salesforce/codegen25-7b-multi
https://huggingface.co/openchat/opencoderplus

Error

Both endpoints consistently fail to deploy. Both fail health checks - error logs available on request as it does not appear I can attach them here.

jonrossclaytor · 2023-07-10T19:26:14Z

@philschmid is there any guidance you can provide on these errors?

JimAllanson · 2023-07-15T13:39:37Z

Currently, all models with sharded checkpoints such as these are failing to deploy, as this library is filtering out files that don't match a predefined allowlist, and the sharded format isn't included in that list.

I've made a PR that fixes this issue in #93 but until it gets merged you might be able to get by by building a custom docker image with my fork, like so:

FROM 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.0.0-transformers4.28.1-gpu-py310-cu118-ubuntu20.04
RUN pip install --no-cache-dir \
    git+https://github.com/JimAllanson/sagemaker-huggingface-inference-toolkit@sharded-checkpoint-support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SageMaker deployment errors #94

SageMaker deployment errors #94

jonrossclaytor commented Jul 10, 2023

jonrossclaytor commented Jul 10, 2023

JimAllanson commented Jul 15, 2023

SageMaker deployment errors #94

SageMaker deployment errors #94

Comments

jonrossclaytor commented Jul 10, 2023

Background

Error

jonrossclaytor commented Jul 10, 2023

JimAllanson commented Jul 15, 2023