Amazon SageMaker AI
Type: full-code · Vendor: Amazon Web Services · Language: Python · License: proprietary · Status: active · Status in practice: mature · First released: 2017
Amazon SageMaker AI is a managed platform for building, training, deploying, and monitoring machine learning models, including shadow testing and production model monitoring.
Description. SageMaker AI provides managed infrastructure for the model lifecycle from training through hosted inference endpoints. Shadow testing deploys a challenger model version alongside the production champion and mirrors a portion of real requests to the shadow variant without returning shadow responses to users. SageMaker Model Monitor watches deployed models in production, using rules to detect drift in data and model quality and alerting operators when deviations occur.
Agent loop shape. SageMaker AI is a managed ML platform rather than an agent loop. Models are trained and deployed to inference endpoints. Shadow testing routes a copied portion of live inference requests to a shadow fleet and compares metrics without serving shadow responses. Model Monitor runs scheduled or continuous monitoring jobs that compare captured production inputs and outputs against a baseline and raise alerts on drift.
Primary use cases
- training and hosting machine learning models
- shadow testing a challenger model against the champion
- production monitoring of model and data quality drift
Key concepts
- Shadow test → shadow-canary (docs) — A deployment that runs a challenger model on a copied portion of live inference traffic alongside the production model, comparing metrics without ever returning the shadow model's responses to users.
- Production variant (docs) — A model-plus-resource configuration hosted behind a SageMaker endpoint; multiple variants can share one endpoint and receive a weighted share of traffic so two models can be A/B-tested on live requests.
- Model Monitor → scorer-live-monitoring (docs) — A scheduled monitoring job that captures production inputs/outputs, compares them against a baseline computed from the training dataset, and raises alerts when data or model quality drifts.
- Model Registry → lineage-tracking (docs) — A catalog of versioned model packages organized into Model Groups, holding training metrics and lineage so each model version is traceable and approvable for deployment.
Patterns this full-code implements —
- ★★Shadow Canary
SageMaker shadow testing deploys a challenger model version alongside the production champion, mirroring a portion of real inference requests to the shadow variant and comparing outputs/metrics witho…
- ★Scorer Live Monitoring
SageMaker Model Monitor observes deployed models in production, applying rules to detect drift in data and model quality and alerting operators without altering the served predictions; note it monito…
- ★★Lineage Tracking
The SageMaker Model Registry catalogs each trained model version into Model Groups, associates training metrics and data with it, and records model lineage so a given prediction can be traced back to…
- ★Dual Evaluation (Offline + Online)
SageMaker pairs an offline track (training-time evaluation and a baseline computed from a held-out dataset) with an online track (Model Monitor on captured production traffic plus live A/B production…
Neighbourhood
Click any neighbour to follow the lineage. Scroll to zoom, drag to pan.