S

Site Reliability Engineer

Starz Entertainment
Full-time
On-site
Greenwood Village, Colorado, United States
$105,000 - $130,000 USD yearly

Job Description

Starz is looking for a skilled, motivated, and hands-on Site Reliability Engineer. As an SRE, you will help ensure the reliability, scalability, and availability of our cloud-based systems. You’ll apply SRE principles to develop tooling, automate operational tasks, and work with cross-functional teams to deliver an outstanding customer experience.

Responsibilities

  • Develop, implement, and maintain tooling, dashboards, and alerting systems to identify and address reliability risks.
  • Monitor system performance and reliability metrics and recommend configuration and infrastructure improvements based on analysis and evolving business needs.
  • Develop and maintain runbooks to standardize incident response procedures, ensuring quick resolution and consistent handling of operational issues.
  • Actively participate in incident response to triage, mitigate, and resolve issues across various infrastructure layers, collaborating with infrastructure and development teams to minimize downtime. Document incident resolutions, postmortem analyses, and reliability improvements to drive knowledge sharing and future prevention.
  • Engage in predictive scaling and capacity planning exercises to ensure the infrastructure can handle content launches and scale to meet growing demand.
  • Identify and automate repetitive tasks to reduce operational toil, streamline processes, improve efficiency, and reduce human error and manual intervention.
  • Troubleshoot, manage, and optimize CI/CD pipelines to streamline deployments, improve automation, and ensure consistent and reliable code delivery.
  • Configure, manage, and optimize log forwarding systems to ensure centralized log collection, improve monitoring, and streamline troubleshooting across infrastructure and applications.
  • Iteratively improve our adherence to security best practices to minimize vulnerabilities and maintain operational reliability.
  • Continuously learn and stay up to date with the latest technology trends in SRE, proactively applying new knowledge to improve system reliability and operational efficiency.
  • Participate in on-call rotations, troubleshooting and resolving incidents to maintain system reliability and minimize service disruptions.

Qualifications & Skills

  • Strong written and interpersonal communication and documentation skills.
  • Proactive problem-solving attitude and a desire for continuous learning and continuous improvement.
  • Solid understanding of complex large-scale, multi-region systems from a reliability perspective.
  • Strong programming and scripting skills in languages such as Python, Java, BASH, C / C# / C++, or similar languages.
  • Experience implementing observability and application monitoring tools such as Splunk, Datadog, New Relic.
  • Knowledge of implementing and managing log aggregation tools such as Splunk, Sumo Logic, Logstash.
  • Demonstrable experience with Cloud Computing platforms (AWS, GCP, Azure).
  • Experience with one or more container and orchestration technologies (Docker, Kubernetes).
  • Experience implementing and managing CI/CD pipelines such as Bitbucket / Bamboo, GitLab, Jenkins, or similar.

Compensation

$105,000 - $130,000

Our Benefits

  • Full Coverage – Medical, Vision, and Dental
  • Annual discretionary bonus and merit increase
  • Work/Life Balance – generous sick days, vacation days, holidays, and wellness days
  • 401(k) company matching
  • Tuition Reimbursement (up to graduate degree)

About the Company

STARZ (www.starz.com), a Lionsgate company, is a leading media streaming platform committed to delivering premium content that amplifies narratives by, about and for women and underrepresented audiences. STARZ is home to the highly rated and first-of-its-kind STARZ app that offers the ability to stream or download STARZ premium content, as well as the flagship domestic STARZ® service, including STARZ ENCORE, 17 premium pay TV channels, and the associated on-demand and online services. STARZ is available across digital OTT platforms and multichannel video distributors, including cable operators, satellite television providers, and telecommunications companies. In February 2021, STARZ launched #TakeTheLead, a multi-faceted and innovative inclusion initiative expanding its existing efforts to improve representation on screen, behind the camera and throughout the company.

EEO Statement

Starz is an equal employment opportunity employer. All employees and applicants are evaluated on the basis of their qualifications, consistent with applicable state and federal laws. In addition, Starz will provide reasonable accommodations for qualified individuals with disabilities. Starz will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of applicable state and federal law.