mlops-multi-account-cdk template: batch transform option #61

marymaj · 2023-01-20T12:41:33Z

Issue #, if available:

Description of changes:

This PR adds capability to create Batch Transform SageMaker Pipeline as an alternative to deployment with SageMaker real-time Endpoint.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

ViktorMalesevic · 2023-01-26T15:55:45Z

...-cdk/mlops-sm-project-template/seed_code/deploy_app/deploy_endpoint/deploy_endpoint_stack.py

+            id=f"{prj_name}-source_scripts",
+            destination_bucket=i_bucket,
+            destination_key_prefix=f"{self.pipeline_name}/{self.timestamp}/source-scripts",
+            sources=[s3_deployment.Source.asset(path=f"{BASE_DIR}/source_scripts")],


As per our discussion, the source scripts seem to be missing in this deploy repository. I'll test as soon as you add them :)

Source scripts added

pasiunaite

A couple of issues in the inference pipeline:

get_approved_package_desc function is not implemented in get_approved_package.py, thus the import in batch_inference_pipeline.py fails
data quality step is added to the inference pipeline but the baseline calculation is missing

pasiunaite · 2023-02-15T22:29:42Z

...k/mlops-sm-project-template/seed_code/deploy_app/deploy_endpoint/batch_inference_pipeline.py

+
+        '''
+        from deploy_endpoint.get_approved_package import get_approved_package_desc
+        spec = get_approved_package_desc()


get_approved_package_desc() function is not implemented in the get_approved_package.py file, so this import fails

pasiunaite · 2023-02-15T22:31:35Z

...k/mlops-sm-project-template/seed_code/deploy_app/deploy_endpoint/batch_inference_pipeline.py

+        '''
+
+        step_process = self.get_process_step()
+        step_data_quality = self.get_data_quality_step(step_process)


The data quality baseline calculation step is not in the training pipeline, so the data quality check in the inference pipeline fails. Please add the data quality baseline calculation to the repo

ViktorMalesevic · 2023-04-14T13:59:26Z

mlops-multi-account-cdk/mlops-sm-project-template/seed_code/deploy_app/upload_assets.py

+logger = logging.getLogger(__file__.split('/')[-1])
+logger.setLevel(getenv("LOGLEVEL", "INFO"))
+
+def upload_assets_to_s3(account_id):


Hey Maria, do we actually need that part?
Can we just keep the pipeline definition and code in the dev account and just point to that file in "deploy_endpoint_stack"?
The fact of copying the pipeline definitions and scripts could be avoided as long as the Role for preprod and prod has access to the s3 bucket in the dev account

CDK multi account template: batch transform option

cba440b

ViktorMalesevic reviewed Jan 26, 2023

View reviewed changes

add source scripts for batch inference

5faa1be

pasiunaite suggested changes Feb 15, 2023

View reviewed changes

marymaj added 2 commits March 27, 2023 12:19

upload assets for batch transform

0229347

diable security evaluation until it's fixed

efc80ab

ViktorMalesevic reviewed Apr 14, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mlops-multi-account-cdk template: batch transform option #61

mlops-multi-account-cdk template: batch transform option #61

marymaj commented Jan 20, 2023

ViktorMalesevic Jan 26, 2023

marymaj Jan 27, 2023

pasiunaite left a comment •

edited

Loading

pasiunaite Feb 15, 2023 •

edited

Loading

pasiunaite Feb 15, 2023 •

edited

Loading

ViktorMalesevic Apr 14, 2023

mlops-multi-account-cdk template: batch transform option #61

Are you sure you want to change the base?

mlops-multi-account-cdk template: batch transform option #61

Conversation

marymaj commented Jan 20, 2023

ViktorMalesevic Jan 26, 2023

Choose a reason for hiding this comment

marymaj Jan 27, 2023

Choose a reason for hiding this comment

pasiunaite left a comment • edited Loading

Choose a reason for hiding this comment

pasiunaite Feb 15, 2023 • edited Loading

Choose a reason for hiding this comment

pasiunaite Feb 15, 2023 • edited Loading

Choose a reason for hiding this comment

ViktorMalesevic Apr 14, 2023

Choose a reason for hiding this comment

pasiunaite left a comment •

edited

Loading

pasiunaite Feb 15, 2023 •

edited

Loading

pasiunaite Feb 15, 2023 •

edited

Loading