Added
1 day ago
Type
Full time
Salary
Salary not provided

Related skills

cloud linux python go incident response

๐Ÿ“‹ Description

  • Partner with Ads Engineering to improve reliability, scalability, and ops.
  • Build infra, tooling, and automation to boost reliability and productivity.
  • Improve observability with monitoring, alerting, tracing, and dashboards.
  • Participate in on-call rotations and lead incident response.
  • Run root cause analysis and drive corrective actions after incidents.
  • Collaborate with software engineers through the service lifecycle.

๐ŸŽฏ Requirements

  • 5+ years in SRE/Infrastructure on large-scale distributed systems.
  • Strong experience supporting high-traffic, user-facing production environments.
  • Distributed systems, networking, Linux, and cloud-native architectures.
  • Go and Python (or similar) programming skills.
  • Troubleshooting across apps, infrastructure, networking, and services.
  • Observability platforms, monitoring, alerting, and incident response.

๐ŸŽ Benefits

  • Global benefits for lifestyle, development, caregiving.
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Group pension with employer match
  • Private Medical and Dental Scheme
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’