Operations

ML Model Operations (MLOps)

Advanced

A machine-learning model is software that also depends on data and gives probabilistic answers. So it needs everything normal software needs (versioning, testing, CI/CD, monitoring) plus its own concerns: data and model versioning, evaluation, drift detection, and rollback. This is the engineering of getting models into production and keeping them healthy, the operational counterpart to High-Risk AI's governance.

High-Risk AI covers the legal and oversight duties for automated decisions. This covers the engineering: how we train, version, deploy, monitor, and retire models reliably. The key difference from ordinary code is that a model's behaviour depends on data and can get worse over time even if the code never changes (drift). So monitoring and re-evaluation are not optional.

For our risk-scoring and AML use, this connects tightly to fail-closed behaviour, human oversight, and evidence. A model's output must be validated, explainable, and never the only gate on a regulated decision (see High-Risk AI, AML Screening, Designing for Failure).

Version and reproduce

Evaluate, deploy, monitor

Decisions, safety & oversight

Self-review checklist

Why it matters: Models fail differently from ordinary code. They get worse silently as data shifts, can encode bias, and make decisions about real people. So they need versioning, evaluation, monitoring, and human oversight as first-class engineering. For our regulated risk and AML use, disciplined MLOps is what makes automated intelligence safe, explainable, and defensible rather than an unaccountable black box.