Amazon SageMaker AI

Type: full-code · Vendor: Amazon Web Services · Language: Python · License: proprietary · Status: active · Status in practice: mature · First released: 2017

Links: homepage docs

Amazon SageMaker AI is a managed platform for building, training, deploying, and monitoring machine learning models, including shadow testing and production model monitoring.

Description. SageMaker AI provides managed infrastructure for the model lifecycle from training through hosted inference endpoints. Shadow testing deploys a challenger model version alongside the production champion and mirrors a portion of real requests to the shadow variant without returning shadow responses to users. SageMaker Model Monitor watches deployed models in production, using rules to detect drift in data and model quality and alerting operators when deviations occur.

Agent loop shape. SageMaker AI is a managed ML platform rather than an agent loop. Models are trained and deployed to inference endpoints. Shadow testing routes a copied portion of live inference requests to a shadow fleet and compares metrics without serving shadow responses. Model Monitor runs scheduled or continuous monitoring jobs that compare captured production inputs and outputs against a baseline and raise alerts on drift.

Primary use cases

training and hosting machine learning models
shadow testing a challenger model against the champion
production monitoring of model and data quality drift

flowchart TD fw["Amazon SageMaker AI"] fw --> p1["Shadow Canary<br/>(first-class)"] fw --> p2["Scorer Live Monitoring<br/>(supported)"] fw --> p3["Lineage Tracking<br/>(supported)"] fw --> p4["Dual Evaluation (Offline + Online)<br/>(supported)"]

Key concepts

Shadow test → shadow-canary (docs) — A deployment that runs a challenger model on a copied portion of live inference traffic alongside the production model, comparing metrics without ever returning the shadow model's responses to users.
Production variant (docs) — A model-plus-resource configuration hosted behind a SageMaker endpoint; multiple variants can share one endpoint and receive a weighted share of traffic so two models can be A/B-tested on live requests.
Model Monitor → scorer-live-monitoring (docs) — A scheduled monitoring job that captures production inputs/outputs, compares them against a baseline computed from the training dataset, and raises alerts when data or model quality drifts.
Model Registry → lineage-tracking (docs) — A catalog of versioned model packages organized into Model Groups, holding training metrics and lineage so each model version is traceable and approvable for deployment.