The key responsibilities incudes:
• Be responsible for the planning, implementation, and growth of the AWS cloud infrastructure
• Understand the current AWS infrastructure and suggest changes to it. & Develop and advocate for
Operations best practices, standards, and processes
• Define and document best practices and strategies regarding Infrastructure deployment and maintenance.
• Build, release, and manage the configuration of all production systems and manage a continuous integration
and deployment methodology for server-based and serverless technologies
• Work alongside architecture and engineering teams to design and implement any scalable software
services; and to automate most common tasks
• Ensure necessary system security by using best in class network and cloud security solutions
• Stay current with new technology options and vendor products, evaluating which ones would be a best fit;
and recommend process and architecture improvements
• Implement continuous integration/continuous delivery (CI/CD) pipelines when necessary
• Troubleshoot the system and solve problems across all platform and application domains
• Supports pre-production acceptance testing to ensure the high quality of services and products
• Actively involved in building out new environments for existing and new applications as required.
• Builds and maintains scripts to support automation of build and release tasks.
• Solve complex installation and integration challenges while documenting solutions, analysis, and
alternatives; and help develop strategies for zero-downtime deployments and patching.
• Supports build and branching strategies in coordination with the development teams and has a working
knowledge to resolve merge conflicts.
• Implements and maintains off-premises, and cloud-based infrastructure, security access policies, and
automations provisioning.
• Supports monitoring solutions to ensure application performance and availability goals are met
• Maintain systems via patching, software updates, upgrades, and resolving security vulnerabilities
• Participate in on-call support rotation and takes ownership of escalated problem resolution
General Requirements:
• BSc. Computer Science or equivalent (BTech. Computer Science candidates may also apply).
• With Typically 2-4 years of experience in systems engineering, reliability engineering and/or Dev Ops, out
of which minimum 2 plus years in cloud systems like AWS
• Good knowledge of AWS cloud platforms is a must. & Any AWS certifications will be a plus
• Proven track record of building and supporting distributed teams, architecture and infrastructure (cloud,
on-premises, off premises)
• Experience with container technologies such as Docker, Kubernetes, EKS
• Understanding of security concepts such as Firewalls, IPS, IDS, VPN, and MFA
• Working knowledge on DevOps concepts and tools; and experience with continuous integration and
continuous delivery systems such as Jenkins (including installation, maintenance, provisioning and
administration)
• Strong experience with monitoring tools such as Zabbix, ELK stack or similar
• Experience automating software build, test, and deployment pipelines following agile methodologies.
• Experience safely automating deployments of cloud infrastructure and services.
• Working knowledge in core TCP/IP networking and web services.
• Excellent communication skills
• Good background in Linux/Unix administration, including writing PowerShell scripts; and sound scripting
knowledge in Python
• Strong understanding of security best practices; and ability to troubleshoot distributed systems.