Senior Site Reliability Engineer
Job Description
REQUIREMENTS
- 7+ years of experience in Cloud, SRE, Systems, or DevOps Engineering
- 5+ years of experience operating production workloads on AWS
- 3+ years of experience supporting blockchain infrastructure, nodes, or Web3 applications
- Proficiency with IaC tools such as Terraform and Helm
- Extensive experience with containers and Kubernetes orchestration
- Strong knowledge of CI/CD pipelines and automation/scripting (Python, Bash, or Go)
- Experience with observability tools like Datadog, Prometheus, or OpenTelemetry
- Understanding of smart contracts and working experience with Solidity
Preferred
- AWS Certifications (Solutions Architect, DevOps Engineer, or SysOps Administrator)
- Deep experience with decentralized system operations and Web3 ecosystems
- Proven experience with SRE methodologies and GitOps workflows
- Experience working in compliance-driven environments (SOC2, PCI, or ISO27000)
RESPONSIBILITIES
- Architect, deploy, and operate highly available AWS infrastructure optimized for blockchain workloads
- Implement Infrastructure as Code (IaC) using Terraform and Helm for repeatable provisioning
- Manage production container platforms including EKS, ECS, Kubernetes, and Docker
- Deploy and maintain blockchain nodes (full/archive/light clients) and RPC endpoints on EVM-compatible chains
- Build and manage CI/CD pipelines using tools like GitHub Actions or Jenkins
- Implement observability strategies using Datadog, Prometheus, and Grafana to monitor blockchain health and system performance
- Lead incident response for S1/S2 events and conduct post-incident reviews
- Drive automation to reduce operational toil and implement AIOps for predictive diagnostics and self-healing
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#CrossChannelJobs #JobSearch
#CareerOpportunities #HiringNow
#Employment #JobOpenings
#JobSeekers
#FacebookLinkedIn