About Us
At Cast & Crew, we’ve empowered creativity and supported the global entertainment industry for decades. Together with our family of brands - Backstage, CAPS, Checks & Balances, Final Draft, Media Services, Sargent-Disc, and The TEAM Companies – we operate as a combined entertainment technology and services provider offering industry standard screenwriting accounting software, digital payroll products, data & reporting, and a host of creative tools. The industry continues to move faster than ever, and the need for our expertise, our technology, and our people has never been greater. We are a production’s best ally every step of the way. #OneCastOneCrew
Position Overview
We are a mid-size entertainment company delivering captivating digital experiences to millions of customers worldwide. Our IT organization powers the infrastructure and systems behind our cutting-edge payroll and accounting applications. We are seeking a Senior Site Reliability Engineer (SRE) to enhance the performance, scalability, and reliability of our infrastructure and help bring our next-generation solutions to life.
As a Senior Site Reliability Engineer, you will ensure the reliability and scalability of our IT systems. You will leverage your skills in cloud technologies, infrastructure operations, Kubernetes orchestration, application development, database administration, Oracle E-Business Suite (EBS), and maintain robust infrastructure that supports business-critical platforms. This role will also involve collaboration with cross-functional teams to implement engineering best practices, monitoring and automation while exploring opportunities to enhance operations with emerging AI technologies.
Key Responsibilities
- Infrastructure as Code: Develop and maintain automated infrastructure provisioning with Terraform for hybrid cloud environments.
- Cloud Expertise: Design and manage robust multi-cloud environments using AWS and Azure, with a focus on optimizing Kubernetes clusters (EKS and AKS).
- Oracle E-Business Suite (EBS): Support, optimize, and ensure the reliability of Oracle EBS deployments, integrating it with other IT systems to maintain smooth business operations.
- Operating Systems Management: Administer and optimize Linux (RHEL) and Windows Server environments to ensure high availability and security.
- Application Performance: Collaborate with development teams to enhance applications built on React, Node.js, .NET, C#, and Java for reliability and performance.
- Networking & Security: Leverage advanced AWS networking skills to implement secure and scalable architectures, including VPC design, load balancing, and advanced routing.
- Database Optimization: Monitor and tune database performance and manage relational and NoSQL databases to support high-traffic entertainment services.
- Monitoring & Troubleshooting: Implement observability tools and proactively address performance issues using platforms like Prometheus, Grafana, Splunk, or CloudWatch.
- Incident Response & Automation: Lead incident management, postmortem reviews, and automation efforts to prevent recurrence and improve overall resilience.
- Cross-Team Collaboration: Work closely with developers, system administrators, and security teams to align infrastructure needs with business and technical goals.
Qualifications
Required Technical Skills
- Expert-level knowledge of Terraform for infrastructure automation.
- Hands-on experience managing Azure Kubernetes Services (AKS) and AWS Kubernetes Services (EKS) clusters.
- Advanced knowledge of AWS and Azure cloud ecosystems, including networking, security, and cost optimization.
- Proficiency in Linux (RHEL) and Windows Server environments.
- Proven experience supporting and optimizing Oracle E-Business Suite (EBS) in a complex IT environment.
- Proven application development experience with React, Node.js, .NET, C#, and Java.
- Strong database administration and performance-tuning skills for both relational (e.g., MySQL, PostgreSQL, MSSQL) and NoSQL (e.g., DynamoDB, MongoDB) databases.
- Advanced networking skills, including VPC design, transit gateways, and hybrid cloud connectivity.
- Expertise in monitoring, logging, and troubleshooting tools like NewRelic, Prometheus, Grafana, Splunk, CloudWatch, and others.
Desired Soft Skills
- Strategic thinking to design scalable and reliable systems for high-demand entertainment platforms.
- Strong collaboration and mentorship abilities to guide teams in adopting SRE best practices.
- Excellent communication skills to work with technical and non-technical stakeholders.
- Adaptability to a fast-paced, dynamic environment.
Nice-to-Have Skills
- Experience with AI-powered Operations (AIOps) to automate troubleshooting and predictive maintenance.
- Experience in high-traffic or live-streaming applications.
- Certifications such as AWS Certified Solutions Architect or Azure Solutions Architect Expert.
- Familiarity with industry-specific compliance standards, e.g., SOC 2, GDPR.
Special Work Conditions
- Sedentary – Involves sitting most of the time but may involve walking or standing for brief periods of time. Some positions may entail exerting up to 15 lbs. of force occasionally and/or a negligible amount of force to lift, carry, push, or pull.
Benefits
Cast & Crew provides a comprehensive package of employee benefits including: Medical, Dental, Vision, PTO, health and wellness programs, employee discounts, and more! Note: Cast & Crew benefits are subject to eligibility requirements.
Cast & Crew is an equal opportunity employer committed to hiring a diverse workforce and sustaining an inclusive culture. It is our policy to provide equal employment opportunities to all individuals based on job-related qualifications and ability to perform a job, without regard to age, gender, gender identity, sexual orientation, race, color, religion, creed, national origin, disability, genetic information, veteran status, citizenship or marital status, and to maintain a non-discriminatory environment free from intimidation, harassment or bias based upon these grounds.
CA residents
Your personal information may be collected in connection with certain services provided by Cast & Crew or its affiliated companies. A summary of your California privacy rights can be found at: https://www.castandcrew.com/privacy-policy/
Compensation is commensurate with various factors including, but not limited to, relevant experience, qualifications, skills, training, licensure, certifications, geographic cost of labor, and other business and organizational needs. Compensation range for candidates in other locations may differ based on the cost of labor in that location. The compensation range for this position is: $130,000.00 - $165,000.00 per year.