Logo of TexanEngineers.com
Fornia.dev
Site Reliability Engineer

zoox

Foster Cityfulltime

Posted on: 6/18/2025

Required Skills:

AWSKubernetesTerraform

Job Description:

Platform/Site Reliability Engineer

Zoox is looking for a platform/site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through deployment, operation, and continual improvement. Zoox is a robotics company and our ethos of automation extends throughout the infrastructure components we build. Be prepared to work with systems handling large volumes of data and data-processing pipelines performing compute-intensive tasks on CPUs and GPUs.

In this role, you will:

  • Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
  • Optimize system performance, reliability, and scalability.
  • Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
  • Collaborate with software engineering teams to improve deployment processes and automation.
  • Conduct root cause analysis of production issues and implement corrective actions.
  • Implement disaster recovery and business continuity plans.

Qualifications

  • 6+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
  • Proven experience with cloud platforms such as AWS, GCP, or Azure.
  • Expertise in container orchestration technologies like Kubernetes.
  • Deep understanding of networking, storage, and database technologies.
  • Strong programming skills in languages such as Python, Go, C/C++ or Java.
  • Experience with infrastructure as code tools such as Ansible, Salt, Terraform or CloudFormation.

Preferred Qualifications

  • Experience in the automotive or autonomous vehicle industry.
  • Knowledge of security best practices and compliance requirements.
  • Previous experience in a leadership or mentorship role.