Senior Software Engineer, Machine Learning (ML Ops)

Added
12 days ago
Type
Full time
Salary
Salary not provided

Related skills

java aws python kubernetes scala

๐Ÿ“‹ Description

  • Lead scalable cloud infra for ML workloads on AWS and GCP with GPUs/TPU.
  • Architect CI/CD for ML models and platform services for fast, safe releases.
  • Own low-latency infra for real-time inference incl. KV store/vector stores.
  • Define observability standards: model performance, drift, capacity, health metrics.
  • Participate in on-call rotation and incident RCA for ML infra.
  • Collaborate with data scientists to improve platform usability and ML workflows.

๐ŸŽฏ Requirements

  • BS or MS in CS/Engineering or related quantitative field.
  • 8+ years in DevOps/SRE/ML infra; 4+ years on large-scale ML.
  • Strong Python and/or Scala/Java for tooling.
  • Kubernetes on GCP (GKE) and AWS (EKS).
  • NoSQL/low-latency stores like Aerospike.
  • Terraform for IaC; CI/CD with Jenkins or GitLab Runner.

๐ŸŽ Benefits

  • Hybrid schedule: in-office Mon-Thu, Fridays remote.
  • Global benefits include healthcare, mental health, retirement options.
  • Paid time off and leave policies to support work-life needs.
  • Inclusive, collaborative Roku culture.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’