Skip to content

SagemakerEstimator in Spark ML Pipeline issue #98

Open
@hdamani09

Description

@hdamani09

Reference: MLFW-2726

System Information

  • Spark
  • SDK Version:spark_2.2.0-1.2.5
  • Spark Version:2.4.3
  • Algorithm: XGBoost

Describe the problem

Hi,
I tried to add a sagemakerEstimator within a Spark ML Pipeline and fit the training dataset on the pipeline which worked without any issues. When I tried to save the pipeline itself, it threw an exception stating the pipeline contains a stage that is not writable.
Is it intended to be that way since when fit runs on the sagemakerEstimator, it automatically persists the model to trainingOutputS3DataPath ?
If I wish to have a pipeline persisted which contains other transformer stages along with the sagemakerEstimator instance how would I do it?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions