Applied AI Engineer, Site Reliability Engineer - EMEA

Related skills

golang ansible terraform grafana prometheus

πŸ“‹ Description

  • Build and operate a fleet of Mistral platforms and apps.
  • Productize reliability; write runbooks; create SLO templates.
  • Run: operate Tier-1 customer environments; ensure SLOs.
  • Enable: provision, secure, and scale Applied AI solutions; automate.
  • Secure: own security ops; lead CVE response and SBOM controls.
  • Framework-first, fleet-management role; scale impact across accounts.

🎯 Requirements

  • Fluent in English.
  • 5+ years in SRE/Production Eng/DevOps with tooling track record.
  • Kubernetes fluency: multi-tenant, namespaces, network policy, RBAC.
  • On-call discipline: incident response, blameless post-mortems.
  • Observability stack: Prometheus, Grafana, OpenTelemetry, Loki, Tempo, Signoz.
  • Infrastructure as code: Terraform, Ansible; Python and/or Golang.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest β€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs β†’