Oracle Health Applications & Infrastructure (OHAI) is hiring in its OHAI Platform & Production Engineering organization!”
Are you a creative person who loves a challenge? Solve the complex puzzles you’ve been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that’s changing how the world does business. We’re looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position.
Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health. This team will focus on product deployment, sustainability, troubleshooting and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
Responsibilities includes:• Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions• React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems• Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support• Partner with the distributed team in prototyping new platform services• Stay informed of new technologies• Innovate• Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence• Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services• Develop designs, architectures, standards, and methods for large-scale distributed systems• Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance.
Career Level -IC3 and IC4
Senior and Principal Level Engineers
Career Level - IC4
Key Requirements & Experience:
• U.S. Citizenship & Security Clearance: This role requires the ability to acquire and maintain a federal security clearance, which mandates U.S. citizenship.
• Large-Scale Systems Development & Operations: Experience designing, developing, and managing large-scale distributed services and applications.
• Containerization Expertise: Proficiency in container administration and development, with hands-on experience using Kubernetes, Docker, Mesos, or similar technologies.
• Infrastructure Automation: Expertise in automating infrastructure with tools like Terraform, Chef, Ansible, Puppet, or Packer to streamline operations.
• Cloud Orchestration & SRE Support: Knowledge in cloud orchestration frameworks and a strong background in Site Reliability Engineering (SRE) support for cloud-native systems.
• CI/CD Pipelines: Solid experience in continuous integration and continuous delivery (CI/CD) pipelines, including version control systems (Git, SVN, etc.), GitLab Runners, Jenkins, and Rundeck.
• Environment Support: Proven ability to work in production, test, and development environments, supporting medium to large-scale user bases.
• Automation Scripting: Proficient in scripting for software deployments and installations using PowerShell, Bash, or similar tools to enhance efficiency.
• Cloud Computing & Analytics: Deep understanding of cloud compute technologies, network monitoring, data processing, and analytics to support cloud-native solutions.
• Programming Expertise: Proficiency in modern programming languages such as Java, Python, or C++ (or equivalent) to build robust systems.
• High Availability & Scalability: Experience with fault-tolerant, highly available, high-throughput, distributed, and scalable systems.
• Cloud Service Operations: Hands-on experience operating services within major cloud platforms such as AWS, OCI, Azure, or similar, ensuring optimal performance and reliability.
#LI-ND1