D

DevOps Engineer - Infrastructure

Data Direct Networks
Full-time
On-site
United States

Overview

DDN Storage is seeking great candidates to join our dynamic team of passionate customer-enabling technologists!

 

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DDN Storage is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.

 

"DDN's A3I solutions are transforming the landscape of AI infrastructure." – IDC

 

“The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments” - ~ Marc Hamilton VP, Solutions Architecture & Engineering | NVIDIA

 

DDN Storage is the global leader in AI and multi-cloud data management at scale. Our cutting-edge storage and data management solutions are designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN Storage empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence.

 

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.

 

Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage.

Job Description

We build the largest scale-out storage and Data management products in support of the world's largest GPU supercomputing clusters, designed for both AI training and serving production models. We are committed to implementing Infrastructure as Code (IaC) best practices, enhancing deployment pipelines, and ensuring robust, secure service delivery across our production environments. Our work spans across on-premise clusters and major cloud providers.

 

DevOps Engineer - Infrastructure (Remote location, Europe or U.S.)

 

Key Responsibilities:

  • Manage and operate extensive scale-out storage systems that support GPU supercomputing clusters.
  • Implement and maintain Infrastructure as Code (IaC) practices.
  • Enhance development as well as deployment pipelines for efficient, secure service delivery.
  • Collaborate on both on-premise and cloud infrastructure.
  • Develop and enforce security best practices for internal and external environments.

Tech Stack:

  • Kubernetes
  • Ansible
  • Pulumi
  • Golang and Python

Ideal Candidate Profile:

  • Proficient in writing scalable and highly available containerized applications using Golang
  • Experienced in managing compute fleets using Pulumi, Terraform, Ansible, or similar stateful automation tools.
  • Strong understanding of IaC best practices and deployment pipeline enhancements.
  • Familiar with security best practices for both internal researchers and live external traffic.

Preferred Qualifications:

  • Hands-on experience with large-scale storage systems.
  • Expertise in Kubernetes for container orchestration.
  • Proven ability to work with both on-premise and cloud-based infrastructure.
  • Knowledge of modern security practices and protocols.

DDN

Our team is highly motivated and focused on engineering excellence.

We look for individuals who appreciate challenging themselves and thrive on curiosity.

Engineers are encouraged to work across multiple areas of the company.

We operate with a flat organizational structure.

All employees are expected to be hands-on and to contribute directly to the company’s mission.

Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills.

They should be able to concisely and accurately share knowledge with their teammates.

 

Interview Process:

 

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 30-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

 

  • Coding assessment in a language of your choice.
  • Systems design: Translate high-level requirements into a scalable, fault-tolerant service.
  • Systems hands-on: Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive: Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week.
  • We don’t rely on recruiters for assessments.
  • Every application is reviewed by a member of our technical team.

 

DataDirect Networks, Inc. is an Equal Opportunity/Affirmative Action employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, transgender, sex stereotyping, sexual orientation, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.