Operations

Backup, Recovery & Business Continuity

Foundational

A backup you have never restored is a hope, not a backup. Data will be deleted, corrupted, ransomed, or lost to a regional outage. When that happens, the only thing that matters is whether you can get the data and the service back fast enough, with little enough lost. Plan for recovery on purpose, set targets, and prove it works.

Resilience (Designing for Failure) keeps the system running through faults. Backup and disaster recovery are about getting it back after the worst has happened: data loss, corruption, a ransomware event, or a whole region going down. Two numbers frame everything. RPO is how much data you can afford to lose, measured in time. RTO is how long recovery may take. Set them per dataset, and design backups and DR to meet them.

For a regulated platform holding irreplaceable customer, KYC, and AML records, this is both an operational need and a certification requirement (see Marketplace & Certification Readiness). Recovery must follow the same rules as everything else: regulated records are retained and restored, never quietly lost, and backups of sensitive data are protected as strongly as the live data.

Back up with intent

Be able to recover

Untested, co-located backup // nightly dump written to the same storage account as production
// never restored; no RPO/RTO defined

If that account is compromised, deleted, or ransomed, both production and its backup go together. And nobody knows if the dump can even be restored. This is a backup in name only.

Isolated, immutable, tested // automated backups -> separate region/account, immutable, encrypted
// RPO 15m / RTO 2h defined per dataset
// quarterly restore drill verifies recovery and timings

Backups survive a compromise of production, cannot be tampered with, meet defined targets, and are proven by regular restore drills. Recovery is something we know works, not something we hope works.

Self-review checklist

Why it matters: Data loss is a serious threat to a business built on irreplaceable customer and regulatory records. The moment you need backups is the worst possible moment to discover they do not work. Defined recovery targets, isolated and immutable backups, and rehearsed restores turn a potential catastrophe into a contained, recoverable event, and meet a baseline every certification and serious customer expects.