B

DevOps Engineer

BioVid
Full-time
Remote
United States

DevOps Engineer / Cloud Engineer 


As a DevOps Engineer / Cloud Engineer, you will play a crucial role in designing, deploying, and maintaining cloud-based infrastructure to support AI applications. You will work closely with software engineers, data scientists, and security teams to ensure the reliability, scalability, and security of the AI platform. This role requires expertise in cloud platforms, automation, and best practices for DevOps and security. 


Responsibilities: 

Cloud Infrastructure Management 

  • Design, deploy, and manage cloud infrastructure on GCP ensuring high availability and cost efficiency. 
  • Configure and manage compute, storage, networking, and security components. 
  • Optimize cloud resources to balance performance and cost-effectiveness. 
  • Implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or similar tools. 
  • Maintain access control, identity management, and security policies in cloud environments. 

Deployment Automation & CI/CD Pipelines 

  • Design and implement CI/CD pipelines to automate the deployment of applications and AI models. 
  • Maintain version control and automated rollback strategies to ensure zero-downtime deployments. 
  • Integrate testing, security scanning, and performance monitoring within CI/CD pipelines. 
  • Work with software and ML engineers to streamline model deployment and retraining workflows. 

Monitoring, Alerting & Incident Response 

  • Set up monitoring systems to track application performance, model accuracy, and infrastructure health using tools like Prometheus, Grafana, or the ELK stack. 
  • Implement automated alerting and anomaly detection to proactively identify issues. 
  • Develop incident response playbooks and troubleshoot system failures. 

Scalability & Reliability 

  • Architect scalable cloud solutions to handle high-traffic loads and AI workloads. 
  • Implement load balancing, auto-scaling, and caching mechanisms for optimal performance. 
  • Optimize database and storage solutions for efficient data processing and retrieval. 
  • Conduct capacity planning and stress testing to prevent bottlenecks and failures. 

Security & Compliance 

  • Implement cloud security best practices, including encryption, firewalls, and IAM policies. 
  • Ensure compliance with industry standards such as SOC 2, GDPR, HIPAA, or ISO 27001. 
  • Perform regular security audits, vulnerability assessments, and risk mitigation strategies. 
  • Manage backups and disaster recovery plans to ensure data integrity and business continuity. 

Required Skills & Experience: 

  • 3+ years of experience in a DevOps, Cloud Engineering, or Site Reliability Engineering (SRE) role. 
  • Strong expertise in GCP  
  • Hands-on experience with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi. 
  • Proficiency in containerization and orchestration using Docker and Kubernetes. 
  • Experience implementing and maintaining CI/CD pipelines with tools like Jenkins, GitLab CI, or CircleCI. 
  • Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK stack, Datadog, New Relic). 
  • Strong knowledge of networking, security, and cloud architecture best practices. 
  • Programming or scripting experience in Python, Bash, or Go for automation and tooling. 
  • Experience with database and storage solutions in the cloud (e.g., PostgreSQL, DynamoDB, S3, BigQuery). 
  • Knowledge of machine learning model deployment and MLOps best practices is a plus. 

BioVid is an Equal Employment Opportunity Employer. We provide equal opportunity in all of our employment practices to all qualified employees and applicants without regard to race, color, religion, gender, national origin, age, disability, marital status, military status, genetic information or any other category protected by federal, state and local laws. This policy applies to all aspects of the employment relationship, including recruitment, hiring, compensation, promotion, transfer, disciplinary action, layoff, return from layoff, training, and social, and recreational programs. All such employment decisions will be made without unlawfully discriminating on any prohibited basis