Site Reliability Engineer

Added
15 days ago
Type
Full time
Salary
Salary not provided

Related skills

github datadog terraform kubernetes playwright

๐Ÿ“‹ Description

  • Support engineers by creating, maintaining, and improving observability and alerting tools.
  • Own the SLO framework; design and maintain SLI indicators for reliability.
  • Own incident management; define standards and coordinate high-severity incidents.
  • Develop and maintain tooling (Terraform modules, Go apps) to automate reliability.
  • Build and promote reporting on operational metrics and incidents.

๐ŸŽฏ Requirements

  • 1-5 years of experience in SRE, DevOps, or Software Eng.
  • Strong communication across multidisciplinary teams.
  • Observability tools expertise (Datadog); metrics, logging, tracing.
  • On-call troubleshooting in production; Kubernetes a plus.
  • English proficiency.
  • Familiarity with SLOs/SLIs.

๐ŸŽ Benefits

  • Hybrid role: 2-3 days in the office.
  • 4 extra weeks maternity/paternity leave.
  • 50% healthcare coverage.
  • Home office equipment allowance.
  • Minimum 25 days of annual leave.
  • 50% transportation paid.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’