P

Site Reliability Engineer

Private Limited
Full-time
Remote
India, India

Job Description:
This team is responsible for managing email systems for various brands which cater to about 7 million mailboxes in total. These systems deliver over 25 million mails a day.
As a Site Reliability Engineer, you will work with a team of highly technical Site Reliability Engineers, Software engineers, System Administrators, Product Managers, and business leads to engineer great products, deliver five 9’s uptime, build reliable systems, tools &, automate processes. You will become an expert detective, diving into complex escalations involving technical challenges, engineering problems, customer connects, and platform growth concerns.

Key Responsibilities:
    •    Be responsible for downtime and maintain the product SLA.
    •    Participate in weekly oncall rotation, solving escalated tickets, resolving outages, and debugging production issues.
    •    Provide great customer service by responding to escalations and building self‐serving tools/methods for customers and support teams. 
    •    Work closely with various stakeholders like Engineering, Monitoring and Operations teams, NetOps &, business development teams.
    •    Strict adherence to automating routine tasks by scripting.
    •    Learning and continuous improvement - Devote time for learning innovative technology and practices.

Basic Skillset:
    •    Excellent knowledge of Linux internals & OS fundamentals like scheduler,  memory, storage, networking, Filesystems etc.
    •    OSI, TCP/IP & networking fundamentals
    •    Exposure to RDBMS like MySQL, PostgreSQL etc.
    •    Exposure to any configuration management tools like Puppet, Ansible, Chef etc
    •    Understanding of GIT concepts / terminologies.
    •    Shell  scripting
    •    Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other teams

Preferred Skillset:
    •    Understanding of DNS-to-Deployments and everything in between.
    •    Experience of managing large-scale infrastructure.
    •    Knowledge of HA-Proxy, Nginx, Heartbeat/KeepAlived, pacemaker etc.
    •    Prior experience of managing global DNS infrastructure and large-scale Email systems is a bonus.
    •    Exposure to RDBMS and NoSQL Databases like MySQL / PostgreSQL, Redis, Cassandra, etc.
    •    Understanding of virtualization and containerization. Working knowledge of Docker, KVM/Libvirt. Exposure to infrastructure orchestration platforms like Kubernetes, OpenShift, OpenStack, & GCP a plus.
    •    Has prior experience in managing mail servers running Postfix, Dovecot, Exim, Roundcube Webmail etc.
    •    Proficient in at least one scripting language like Python, Ruby, Golang, Perl, PowerShell, etc.
 

This Job Description includes the essential job functions required to perform the job described above, as well as additional duties and responsibilities. This Job Description is not an exhaustive list of all functions that the employee performing this job may be required to perform. The Company reserves the right to revise the Job Description at any time, and to require the employee to perform functions in addition to those listed above.