J

Senior Lead Software Engineer - AI/ML Platform Engineering

JPMorganChase
Full-time
On-site
Jersey City, New Jersey, United States
$171,000 - $260,000 USD yearly
Description

Be an integral part of an agile team that's constantly pushing the envelope to enhance, build, and deliver ML technology products.

As a Senior Lead Software Engineer at JPMorgan Chase within the Corporate Sector, AI/ML Technology, you play an integral role in an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. In this role you will drive significant business impact using your capabilities and contributions. You will apply your deep technical expertise and problem-solving methodologies to tackle a diverse array of challenges that span multiple technologies and applications.

Job responsibilities

  • Architects and implements distributed ML infrastructure, including inference, training, scheduling, orchestration, and storage.
  • Develops advanced monitoring and management tools for high reliability and scalability.
  • Optimizes system performance by identifying and resolving inefficiencies and bottlenecks.
  • Collaborates with product teams to deliver tailored, technology-driven solutions.
  • Drives decisions that influence the product design, application functionality, and technical operations and processes
  • Integrates Generative AI within the ML Platform using state-of-the-art techniques.
  • Adds to the team culture of diversity, equity, inclusion, and respect
  • Provides hands on experience with the ability to analyze, write, develop, test, and  release products using Python on AWS

 Required qualifications, capabilities, and skills

  • Formal training or certification on software engineering concepts and 5+ years applied experience
  • Deep expertise in AWS / Azure and Kubernetes ecosystem, including EKS, Helm, Custom Operators and Terraform.
  • Advanced in Python programming language
  • Background in High Performance Computing, ML Hardware Acceleration (e.g., GPU, TPU, RDMA), or ML for Systems.
  • Strong coding skills and experience in developing large-scale ML systems.
  • Extensive hands-on experience with ML frameworks (TensorFlow, PyTorch, JAX, scikit-learn).
  • Proven track record in contributing to and optimizing open-source ML frameworks.
  • Strategic thinker with the ability to craft and drive a technical vision for maximum business impact.
  • Demonstrated leadership in working effectively with engineers, data scientists, and ML practitioners.
  • Proven ability to identify trade-offs, clarify project ambiguities, and drive decision-making
  • Ability to tackle design and functionality problems independently with little to no oversigh

Preferred qualifications, capabilities, and skills 

  • Excellent problem-solving and analytical skills
  • Ability to work independently and in a team.
  • Passion for Innovations and continuous Learning
  • Experince with Java is a plus