Team Lead Site Reliability Engineer
Job Description
REQUIREMENTS
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
At least 5 years of experience in Site Reliability Engineering, with 2+ years in a leadership or management role.
Proven, hands‑on expertise in Microsoft Azure, including designing, deploying, and managing cloud-native infrastructure.
Experience with container orchestration (e.g., Kubernetes) is required.
A deep understanding of network protocols, load balancing, and high availability configurations.
Experience in applying software development solutions to SRE and familiarity with programming languages such as (preferably) PowerShell and C# or else Python, Go, Java etc.Experience with automation tools, infrastructure as code (e.g., Terraform, Ansible).
Proficiency in monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and in implementing comprehensive monitoring solutions.
Dynatrace knowledge is a plus.
Excellent problem‑solving skills, with a proven ability to tackle complex issues under pressure.
Outstanding leadership qualities, with a track record of mentoring and developing high‑performing teams.
Exceptional communication and collaboration skills, capable of working effectively with cross‑functional teams.
RESPONSIBILITIES
- Lead SRE team, setting objectives and guiding reliability.
- Embed reliability best practices into development.
- Develop and implement SRE policies (SLOs, SLIs).
- Drive automation to reduce manual work and improve performance.
- Oversee incident management and root cause analysis.
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#CrossChannelJobs #JobSearch
#CareerOpportunities #HiringNow
#Employment #JobOpenings
#JobSeekers
#FacebookLinkedIn