Disaster Recovery Documentation & Runbook Specialist
Job Description
REQUIREMENTS
- Demonstrable experience in developing and managing DR documentation frameworks, runbooks, and DRPs within complex enterprise environments.
- Strong understanding of DR and Business Continuity principles, including RTO/RPO, dependency mapping, failover/failback sequencing, and impact analysis.
- Experience working alongside technical teams (infrastructure, application, network) to translate technical designs and test outcomes into accurate, structured documentation.
- Proficiency with document management and version control tools and processes.
- Experience supporting DR tabletop exercises, DR tests, and live failover events in a documentation capacity.
- Strong organisational skills with the ability to manage multiple concurrent documentation streams across different technical domains.
- Excellent written communication skills, with the ability to produce documentation that is clear, concise, and suitable for both technical and non-technical audiences.
- Familiarity with audit and regulatory requirements for DR documentation and evidence retention.
Preferred
- Experience working within a Managed Services or Professional Services DR engagement.
- Exposure to DR automation platforms and familiarity with how automation workflows translate to runbook procedures.
- Knowledge of ITIL, ISO 22301, BCI Good Practice Guidelines, or equivalent business continuity and DR frameworks.
- Experience with Dell Technologies infrastructure environments is an advantage.
- Relevant certifications: BCI (CBCI/MBCI), DRII (CBCP), ITIL, or equivalent.
RESPONSIBILITIES
- Own the end-to-end DR documentation framework, including the definition and maintenance of standards, templates, and version control processes.
- Develop, maintain, and continuously update DRPs in alignment with automation workflow documentation, reflecting changes in applications, infrastructure, and network as they occur.
- Ensure that application and infrastructure dependency maps, startup/shutdown sequences, and impact analyses are accurately represented in DR runbooks and operational guides.
- Coordinate with the Technical Lead, Application SME, Infrastructure SME, Network SME, and Business Continuity stakeholders to capture design decisions, test results, lessons learned, and remediation actions as formal documentation updates.
- Support DR tests, exercises, and live events by preparing and distributing documentation packs, and capturing outcomes, deviations, and improvement actions in post-event documentation.
- Maintain traceability between automation workflow logic and corresponding runbook steps, ensuring alignment is preserved through every change cycle.
- Establish and enforce documentation governance processes, including review cycles, approval workflows, and change control integration for DR documentation.
- Work with the broader team to identify gaps between documented procedures and actual DR automation behaviour, and drive resolution through structured updates.
- Produce clear, audit-ready documentation that satisfies regulatory and compliance requirements for DR evidence and operational records.
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#CrossChannelJobs #JobSearch
#CareerOpportunities #HiringNow
#Employment #JobOpenings
#JobSeekers
#FacebookLinkedIn