Skip to content

Log training image used for the SM Training Job when user error #2063

@ChaiBapchya

Description

@ChaiBapchya

Describe the bug
Error is

botocore.exceptions.ClientError: An error occurred (ValidationException) when calling the CreateTrainingJob operation: Invalid training image. Please provide a valid Amazon Elastic Container Registry path of the Docker image to run.

In such a case, we should report the training image that is being used [which is supposedly invalid]
This would improve user experience.

To reproduce
Pass image uri that doesn't exist in the ECR

Expected behavior
Fail with the same error but describes the incorrect image name

System information
A description of your system. Please provide:

  • SageMaker Python SDK version: 2.23.1
  • Python version:3.7
  • CPU or GPU:gpu
  • Custom Docker image (Y/N):Y

Additional context
NA

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions