Position Description:
Support a mission office with infrastructure and related services for a data / ML environment. Duties will include implementation of infrastructure as code, CI/CD pipelines for automation of deployments of new and updated elements of the data pipeline, performance monitoring, auto-scaling, support for integration of new tools and capabilities. Elements of the environment are to be increasingly containerized and deployed on GPUs.
Must have:
- 5 years of experience and bachelor's degree or 10 years of experience
- Experience with Python, Linux and Bash
- Experience with Docker & Kubernetes for containerization
- Experience creating unit tests, documentation, and participating in code reviews
- Experience with developing CI/CD pipelines to automate building, testing and deployment across multiple environments
- Experience with Terraform or Ansible for infrastructure scripting
Desired
- Experience with ELK stack
- Experience with managing Kubernetes deployments