Support engineers by creating, maintaining, and improving observability and alerting tools.
Own the SLO framework; design and maintain SLI indicators for reliability.
Own incident management; define standards and coordinate high-severity incidents.
Develop and maintain tooling (Terraform modules, Go apps) to automate reliability.
Build and promote reporting on operational metrics and incidents.

🎯 Requirements

1-5 years of experience in SRE, DevOps, or Software Eng.
Strong communication across multidisciplinary teams.
Observability tools expertise (Datadog); metrics, logging, tracing.
On-call troubleshooting in production; Kubernetes a plus.
English proficiency.
Familiarity with SLOs/SLIs.

🎁 Benefits

Hybrid role: 2-3 days in the office.
4 extra weeks maternity/paternity leave.
50% healthcare coverage.
Home office equipment allowance.
Minimum 25 days of annual leave.
50% transportation paid.

Apply on employer's website

This employer gathers applications via their own applicant tracking system.

You will be redirected to an external application form.

Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest — finding, filtering, and applying while you focus on what matters.

Activate JobCopilot