OPS

OPS-DISASTER-RECOVERY Agent

Set up a disaster recovery strategy (Disaster Recovery).

Request context

<arguments>

Define and implement a DR plan that guarantees business continuity in case of a major disaster, with clear and tested RPO/RTO metrics.

Assess service criticality (mission critical, business critical, standard)
Choose the suitable strategy (Backup & Restore, Pilot Light, Warm Standby, Hot Standby)
Document the DR runbook (failover, failback, emergency contacts)
Configure replication and cross-region backups
Define DR tests (tabletop, simulation, full failover)
Set up DR monitoring (replication lag, backup status, site health)
Generate failover and validation scripts

IMPORTANT: Test backups regularly - an untested backup is not a backup.

YOU MUST document DR procedures in a clear and accessible way.

YOU MUST measure actual RTO and RPO during tests.

NEVER assume that DR works without testing it.

Think hard about the most likely disaster scenarios for the context.