Senior SRE Tech Lead

New

Skills

ELK Grafana Helm Kubernetes Prometheus Terraform Vault

Job Overview

Seeking a Senior Site Reliability Engineer to architect highly available distributed systems, manage error budgets, and lead post-mortems.

Responsibilities
  • Architect globally distributed systems for performance and DR
  • Define and enforce SLOs/SLIs
  • Lead post-mortems
  • Participate in on-call rotations
  • Identify and automate manual operations
  • Design multi-layer monitoring using Prometheus, Grafana, ELK
  • Mentor team members globally
Requirements & Qualifications
  • 10+ years in high-traffic environments
  • Experience with Kubernetes and Helm/ArgoCD
  • Proficient in IaC (Terraform)
  • Hands-on experience with Consul, Vault, HAProxy
  • Experience with large-scale MTAs and Postfix
  • Proficiency in Go or Python
  • Good to have: Experience with NGFW and LDAP infrastructure

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: 12 Months

Share this job:

Similar Jobs

Environment Automation Engineer

Posted 6 days ago

Building and scaling multi-tenant infrastructure using Terraform, Ansible, and Kubernetes.

Debugging production issues across various services and applications.

Ansible ELK Gitlab Go

Security Analyst

Posted 27 days ago

Deliver exceptional security support with advanced expertise and clear communication.

Serve as technical leader and mentor, guiding teammates through knowledge sharing.

Cybersecurity ELK Google Workspace Splunk

Systems Engineer - WordPress VIP

Posted 337 days ago

Recruiting Systems Engineers for WordPress VIP

Building and maintaining global infrastructure

Devops Docker ELK Engineer

WordPress VIP Systems Engineer

Posted 359 days ago

Develop and maintain scalable infrastructure.

Enhance system performance and fault tolerance.

Devops Docker ELK Engineer
overtime