S

Senior Site Reliability Engineer | 2025-020

StarRez
Full-time
Remote
United Kingdom, United Kingdom

About StarRez

StarRez, Inc. is the leading student housing and property management platform in the world.  Our cloud software solutions serve 1,300 institutions, in 25 countries, with over 3 million beds.  With a customer satisfaction score of 99%, many of the most prestigious Universities, Colleges and Property Managers across the globe rely on StarRez to transform their student residential experience.  Along with the recent combination of Adirondack Solutions and RMS, this growing scale enables even greater opportunities to expand community value through our product capabilities and services.  We provide opportunities for students and residents to Thrive! 


The Role

Site Reliability Engineers at StarRez are responsible for ensuring the smooth operation of StarRez products and platforms. By applying software and systems engineering principles, they enhance system reliability while minimising manual intervention. SREs are expected to be experienced in software engineering principles, operational discipline, and automation.

 

As a Senior Site Reliability Engineer, you’ll be joining our Platforms teams with SRE and Platform Engineers based out of three regions in a “follow the sun” model to operate a multi-product/multi-region cloud platform.


Role Specifics

  • Work Location: Remote - United Kingdom
  • Travel: <5% [The percent of travel is an estimation, and it could vary up or down based on business needs throughout the year.]
  • Reporting Structure: Reports to Lead Site Reliability Engineer 


What You Will Own

  • Provide technical leadership and mentoring within the team through knowledge sharing sessions, pair programming, code reviews and solution design
  • Identify and implement solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.
  • Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents
  • Conduct/participate in Root Cause Analyses (RCAs) and blameless post-mortems
  • Participate in on-call rotations to ensure system reliability and rapid incident response.
  • Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth
  • Conduct performance tests to identify and remediate bottlenecks
  • Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.
  • Monitor, review and tune databases to ensure high availability and performance
  • Collaborate with product engineering teams to design/build fit-for-purpose and observable software
  • Contribute and collaborate across teams to define Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreements (SLAs) as required


Required Qualifications

  • Bachelor's degree in Computer Science, Information Technology, or similar
  • Proven experience (2+ Years) in a Platform Engineering, Site Reliability Engineering or Software Engineering role.
  • Proficiency in at least one (or more) object-oriented programming language (C# preferable)
  • Production experience operating containerization technologies (Kubernetes).
  • Proficiency with one or more public cloud providers such as Azure, AWS or GCP
  • Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation.
  • Proficiency in scripting and automation using languages like Bash, PowerShell or Python.
  • Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.
  • Proven track record of maintaining highly-available and performant production environments.
  • Ability to identify and implement effective mitigation strategies and operational playbooks.


Preferred Qualifications

  • Experience in CI/CD tooling: Azure DevOps/GitHub Actions, Octopus Deploy
  • Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus
  • Experience in database management/performance tuning, particularly MSSQL.


Reasons to join our Team:

  • Opportunity to be a part of a well-established, high-performance company that has been in business for over 30+ years
  • Full benefits including health care, paid time off, life insurance, and 401k plan with company match for eligible team members.
  • A supportive team environment with emphasis on learning and development opportunities
  • Our Promise: You will learn, grow, and be appreciated for your impact and contributions.
  • Z-Factor: Our most celebrated value, you will work with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.


We are proud of our diverse workforce and are dedicated to creating a safe and welcoming environment for all employees. People from various ethnicities, ages, genders, and abilities are encouraged to apply.



Notice to external Recruiters and Recruitment Agencies:

StarRez will not accept unsolicited resumes from recruitment agencies, headhunters, or any other third parties for this role through this website or directly to any employee. StarRez and any of our subsidiaries will not pay fees to any third-party agency or company. In addition, we ask that you do not reach out to any employee with regards to this position, or any other positions, now, or in the future.