Confluent logo

Staff Site Reliability Engineer - Incident Management (Remote - Canada)

Confluent
Full-time
Remote
Canada, Canada


Position at Infinitem Canada Ltd.


With Confluent, organizations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better every day – we're creating an entirely new category within data infrastructure - data streaming. This technology will allow every organization to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.

One Confluent. One team. One Data Streaming Platform.

Data Connects Us.

About the Role:

Do you have a passion for data that can turn events into outcomes, enabling intelligent, real-time apps, and empowering teams and systems to be able to act on data instantly? Have you ever dreamt about the opportunity to work with key agencies of the public sector? Confluent's team of Site Reliability Engineers, will allow you to do just that by putting you in the driver seat to deliver highly performant, reliable systems that enable prominent public sector agencies to make real time decisions with their data to solve real time problems through Confluent Cloud. Confluent Cloud delivers a complete end-to-end streaming experience as a Software as a Service (SaaS) model.Β 

What You Will Do:

  • Partner with our Cloud Architecture and Engineering teams to build upon the operational resiliency of the Confluent Cloud systems
  • Collaborate broadly across teams to verify and deploy production changes to Confluent Cloud systems and infrastructure
  • Be an active partner with peer engineering teams, engaging during incidents and driving towards positive outcomes for our customers
  • Maintain critical monitoring used for triage and escalations in the federal space and improve upon automated recovery
  • Adhere to established incident and change management processes and help drive continuous improvementsΒ 
  • Strong writing and verbal skills, with experience in communicating with Enterprise Customers

What You Will Bring:

  • 10+ years of relevant experience
  • Expertise in Cloud Native technologies with experience operating production services in the cloud
  • Strong fundamentals of Distributed Systems and their design
  • Deep knowledge of Kubernetes and containerization
  • Experience with telemetry tooling to monitor production systems
  • Confidence with problem-solving and troubleshooting critical services
  • Proficiency with scripting and automation (e.g Go, Java, Python, Bash)
  • Working knowledge of infrastructure as code (e.g Terraform, Cloudformation, AWS CDK, Pulumi)
  • Exceptional teamwork, collaboration skills, and the ability to act critically with minimal supervision at times in a remote first environment
  • Experience with a rotating on-call schedule to provide 24/7 support
  • BS Degree in Computer Science, Engineering, or equivalent experience

Β 

Come As You Are

At Confluent, equality is a core tenet of our culture. We are committed to building an inclusive global team that represents a variety of backgrounds, perspectives, beliefs, and experiences. The more diverse we are, the richer our community and the broader our impact. Employment decisions are made on the basis of job-related criteria without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by applicable law.

Click HERE to review our Candidate Privacy Notice which describes how and when Confluent, Inc., and its group companies, collects, uses, and shares certain personal information of California job applicants and prospective employees.
#LI-Remote