Staff Site Reliability Engineer

Added
3 days ago
Type
Full time
Salary
Salary not provided

Related skills

gitops ansible terraform grafana prometheus

๐Ÿ“‹ Description

  • Design, build, and operate scalable infra across AWS and GCP.
  • Lead reliability initiatives, incl. ECS to EKS/GKE migrations.
  • Act as technical authority on Kubernetes (EKS/GKE), cloud infra (AWS/GCP), and CI/CD.
  • Partner with teams to enable microservices with production readiness.
  • Implement infrastructure as code with Terraform and Ansible.
  • Drive observability, performance, and cost improvements.

๐ŸŽฏ Requirements

  • 8+ years in SRE/DevOps/Infrastructure.
  • 3โ€“5 years Kubernetes (EKS/GKE) in production.
  • 3โ€“5 years AWS and GCP.
  • 3โ€“5 years Terraform for multi-cloud infra.
  • 5+ years coding in Python, Go, or similar.
  • Experience implementing SLOs/SLIs and RCAs.

๐ŸŽ Benefits

  • Well-being benefits and programs.
  • Social impact initiatives and community.
  • In-person onboarding to accelerate impact.
  • Global community across 20+ offices.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’