As Site Reliability Engineer, you will be responsible for deploying and maintaining diagnostic software across multiple centers, ensuring system reliability, performance, observability and security. You will collaborate with development and support teams, automate infrastructure using tools like Kubernetes, Ansible, and Terraform, and document IT solutions while managing the full product lifecycle.
Position is based in Paris and can be offered remotely.
Deploy diagnostic software products across multiple centers (hospitals or pathology laboratories), both on-premise and on the cloud.
Be familiar with a wide range of technologies (DevOps, GitOps, Open Source software) and infrastructure (from physical server via lower layers to Kubernetes architecture concepts & solutions)
Provide detailed specifications for the proposed IT solutions including hosting specifications, network flow matrix, RACI and security and you will document them.
Support the Customer Support team by providing primary operational support and engineering to centers at which products are deployed.
Run the production environment by monitoring availability and taking a holistic view of system health, and manage the full product lifecycle (including decommissioning).
Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
Partner with development teams to improve services through rigorous testing and release procedures and to create sustainable systems and services through automation and uplifts.
Provide operation support and engineering to centers for dataset import
Deploy data science environments with Sagemaker to meet the needs of data scientists.