Rev logo

Senior Site Reliability Engineer

Rev
Full-time
Remote
United States

There’s never been a more exciting time to be part of Rev.

Every role here plays a critical part in shaping the future of speech technology and empowering our customers to do more, faster. We didn’t disrupt the industry by playing it safe. We did it by embracing bold thinking, welcoming diverse perspectives, and giving our team the freedom and responsibility to innovate. At Rev, you won’t just have a seat at the table — you’ll help redesign it.

Come build what’s next with us 🚀


Senior Site Reliability Engineer

How this role will Serve, Own and Grow at Rev:

We’re looking for a Senior Site Reliability Engineer (SRE) to join our Platform Engineering team. As a key contributor, you’ll collaborate across Engineering, QA, and DevOps to design, scale, and optimize our cloud-based production infrastructure. This is a high-impact role for someone who thrives in startup environments and is excited to help shape the future of Rev’s platform as we scale.

This role is ideal for someone who is passionate about automation, observability, reliability, and continuous improvement—and who loves solving problems at scale.

🔍 Responsibilities:

  • Manage the infrastructure and observability of Rev’s cloud based applications

  • Design and maintain CI/CD pipelines to support scalable, testable deployments across services

  • Automate key development and infrastructure workflows to drive team velocity

  • Collaborate with engineers and QA to deliver reliable environments and robust tooling for evolving needs

  • Analyze infrastructure performance and implement data-driven optimizations

  • Enhance monitoring and alerting systems to detect anomalies and prevent service degradation

  • Help define and deliver on the DevOps roadmap in support of our fast-growing Engineering org

  • Contribute to and maintain internal tools like our custom chatbot (“Chopper”)

Qualifications:

  • Bachelor's or Master’s degree in Computer Science or a related technical field

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Software Engineering

  • Strong experience managing cloud infrastructure in AWS using Terraform or other Infrastructure as Code tools

  • Experience managing containerized workloads in Kubernetes and EKS or similar cloud environments

  • Deep familiarity with building and optimizing CI/CD pipelines (for APIs, web apps, and data services)

  • Proficiency in configuring observability platforms (monitoring, alerting, logging) such as Grafana

  • Proactive and clear communication skills to collaborate with remote teams across time zones

  • Fluency across multiple languages, frameworks, and deployment environments

  • Ability to thrive in a fast-paced, evolving startup culture

Nice to have knowledge of:

  • Experience with the LGTM observability stack (Loki, Grafana, Tempo, Mimir)

  • Hands-on experience managing technologies like Redis, SQL Server, Elasticsearch

  • Prior experience supporting large, distributed teams or remote-first organizations

  • Experience defining CI pipelines in Jenkins and GitHub Actions

  • Experience with Spinnaker for deployments

#LI-Remote

Please note: If you're based in Austin, TX, this is a hybrid role with an expectation of being onsite at our office 1–2 days per week. Our office is located at 1717 W 6th St, Suite 310, Austin, TX 78703.