Senior Site Reliability Engineer | 2025-020

StarRez

Full-time

Remote

United Kingdom, United Kingdom

About StarRez

StarRez, Inc. is the leading student housing and property management platform in the world. Our cloud software solutions serve 1,300 institutions, in 25 countries, with over 3 million beds. With a customer satisfaction score of 99%, many of the most prestigious Universities, Colleges and Property Managers across the globe rely on StarRez to transform their student residential experience. Along with the recent combination of Adirondack Solutions and RMS, this growing scale enables even greater opportunities to expand community value through our product capabilities and services. We provide opportunities for students and residents to Thrive!

The Role

Site Reliability Engineers at StarRez are responsible for ensuring the smooth operation of StarRez products and platforms. By applying software and systems engineering principles, they enhance system reliability while minimising manual intervention. SREs are expected to be experienced in software engineering principles, operational discipline, and automation.

As a Senior Site Reliability Engineer, you’ll be joining our Platforms teams with SRE and Platform Engineers based out of three regions in a “follow the sun” model to operate a multi-product/multi-region cloud platform.

Role Specifics

Work Location: Remote - United Kingdom
Travel: <5% [The percent of travel is an estimation, and it could vary up or down based on business needs throughout the year.]
Reporting Structure: Reports to Lead Site Reliability Engineer

What You Will Own

Provide technical leadership and mentoring within the team through knowledge sharing sessions, pair programming, code reviews and solution design
Identify and implement solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.
Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents
Conduct/participate in Root Cause Analyses (RCAs) and blameless post-mortems
Participate in on-call rotations to ensure system reliability and rapid incident response.
Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth
Conduct performance tests to identify and remediate bottlenecks
Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.
Monitor, review and tune databases to ensure high availability and performance
Collaborate with product engineering teams to design/build fit-for-purpose and observable software
Contribute and collaborate across teams to define Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreements (SLAs) as required

Required Qualifications

Bachelor's degree in Computer Science, Information Technology, or similar
Proven experience (2+ Years) in a Platform Engineering, Site Reliability Engineering or Software Engineering role.
Proficiency in at least one (or more) object-oriented programming language (C# preferable)
Production experience operating containerization technologies (Kubernetes).
Proficiency with one or more public cloud providers such as Azure, AWS or GCP
Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation.
Proficiency in scripting and automation using languages like Bash, PowerShell or Python.
Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.
Proven track record of maintaining highly-available and performant production environments.
Ability to identify and implement effective mitigation strategies and operational playbooks.

Preferred Qualifications

Experience in CI/CD tooling: Azure DevOps/GitHub Actions, Octopus Deploy
Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus
Experience in database management/performance tuning, particularly MSSQL.

Reasons to join our Team:

Opportunity to be a part of a well-established, high-performance company that has been in business for over 30+ years
Full benefits including health care, paid time off, life insurance, and 401k plan with company match for eligible team members.
A supportive team environment with emphasis on learning and development opportunities
Our Promise: You will learn, grow, and be appreciated for your impact and contributions.
Z-Factor: Our most celebrated value, you will work with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.

We are proud of our diverse workforce and are dedicated to creating a safe and welcoming environment for all employees. People from various ethnicities, ages, genders, and abilities are encouraged to apply.

Notice to external Recruiters and Recruitment Agencies:

StarRez will not accept unsolicited resumes from recruitment agencies, headhunters, or any other third parties for this role through this website or directly to any employee. StarRez and any of our subsidiaries will not pay fees to any third-party agency or company. In addition, we ask that you do not reach out to any employee with regards to this position, or any other positions, now, or in the future.

Apply now

Share this job

Twitter Facebook Linkedin Email