Engineering Excellence

Measuring What Matters

Intermediate

Metrics are powerful, and they can also do harm. The right ones focus a team and drive real improvement. The wrong ones, or the right ones used badly, change behaviour for the worse, reward small things, and break trust. This page is about choosing metrics that reflect real value, knowing their limits, and never letting a number replace judgement.

Goodhart's Law warns: "when a measure becomes a target, it stops being a good measure." People work to improve whatever is counted. So a badly chosen metric slowly changes behaviour around it, often in ways that hurt the real goal. A related idea is the difference between actionable metrics, which tell you what to do, and vanity metrics, which look good but change nothing.

This page adds judgement on top of DORA, SPACE, and benchmarking. It explains how to pick, read, and use metrics well, and how to avoid the common traps that make measurement pointless or harmful.

Choose metrics that reflect value

Avoid the traps

Velocity as a target // team told: increase story-point velocity 20% each quarter
// result: points inflated, corners cut, quality drops, trust erodes

Velocity became a target, so it stopped measuring anything real. Estimates grow and quality drops. This is Goodhart's Law: the number rises while value falls.

Outcome, balance, and the team's view // track DORA (speed + stability) + a satisfaction signal,
// review trends in retros, ask the team what is slowing them down,
// run an experiment, re-measure

Balanced outcome metrics show the real bottlenecks. The team's input explains them. Improvement comes from experiments, not from hitting a number people can game.

Self-review checklist

Why it matters: Metrics shape behaviour whether you mean them to or not. So the wrong ones make a team worse: gaming, cut corners, and lost trust, all while looking like progress. Choose metrics that are actionable, balanced, and tied to real value. Hold them as signals, not targets. That is what makes measurement drive real improvement instead of empty reporting.