Amazon Web Services (AWS) logo

Senior Software Engineer - AI/ML, AWS Neuron Distributed Training

Amazon Web Services (AWS)

Job Overview

Amazon Web Services (AWS) is seeking a Senior Software Engineer to join the Machine Learning Applications (ML Apps) team, focusing on AWS Neuron for distributed training. This role involves building, delivering, and maintaining complex products that impact millions globally, designing fault-tolerant systems that operate at massive scale in the AWS Cloud.

Responsibilities

  • Lead the development of distributed training support in Pytorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.
  • Tune ML models to ensure high performance and efficiency on AWS Trainium and Inferentia silicon and TRn1, Inf1 servers.
  • Collaborate with chip architects, compiler engineers, and runtime engineers to build and optimize distributed training solutions.

Qualifications

Basic Qualifications

  • 3+ years of non-internship professional software development experience.
  • Experience in design or architecture of new and existing systems.
  • Proficiency in programming with at least one software programming language.
  • Deep learning industry experience.

Preferred Qualifications

  • Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Bachelor's degree in computer science or equivalent.
  • Expertise in Pytorch, Jax, Tensorflow, distributed libraries, and frameworks.
  • Experience in end-to-end model training.

About the Team

The ML Apps team at AWS Neuron works closely with various disciplines including silicon engineering, hardware design and verification, software, and operations. The team is dedicated to supporting new members, promoting knowledge sharing and mentorship, and is committed to providing a work environment that balances professional challenges with personal life.

Benefits

  • Flexible working hours to support work-life balance.
  • Opportunities for mentorship and career growth within the company.
  • Inclusive team culture with a focus on employee well-being.

Benefits
Extracted with AI

  • Flexible working hours
  • Work-life balance support
  • Mentorship and career growth opportunities

Similar jobs

Last update: 23 minutes ago

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Machine Learning Engineer, AWS Neuron Apps

Senior ML Engineer needed for AWS Neuron Apps, focusing on ML Inference with expertise in Python, TensorFlow, and distributed computing.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Software Development Manager, ML Acceleration

Lead the development of ML Inference features in a senior managerial role at AWS, focusing on distributed systems and ML frameworks.

Amazon logo
Amazon

Senior Software Engineer, Machine Learning Infrastructure

Join Amazon's Search team as a Senior Software Engineer in ML Infrastructure, focusing on large-scale distributed systems and deep learning.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior ML Engineer - Generative AI Innovation Center

Senior ML Engineer needed in Milan for AWS, focusing on generative AI innovations, requiring expertise in AI, ML, Python, and cloud technologies.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Worldwide Specialist, GenAI Model Training & Inference

Join AWS as a Senior Specialist in GenAI Model Training & Inference, driving customer adoption and scaling workloads.

Amazon logo
Amazon

Senior Software Development Engineer, Applied AI

Join Amazon's Applied AI team as a Senior Software Development Engineer to innovate with AI technologies.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Software Development Engineer Intern - Machine Learning Chip Architect

Join AWS as a Software Development Engineer Intern focusing on Machine Learning Chip Architecture. Enhance your skills in a dynamic environment.

Lambda logo
Lambda

Senior Software Engineer - Cloud

Join Lambda as a Senior Software Engineer to build the world's best deep learning cloud using AWS, Python, and distributed systems.

Amazon logo
Amazon

Senior Machine Learning Engineer

Join Amazon as a Senior Machine Learning Engineer to build scalable AI/ML infrastructure and MLOps platforms.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Software Development Engineer III, AWS IDEs

Join AWS as a Senior Software Development Engineer to build AI/ML tools for developers, enhancing cloud experiences.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Applied Scientist, Artificial General Intelligence

Join AWS as an Applied Scientist in Artificial General Intelligence, driving AI innovation in cloud computing.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Applied Scientist, AWS Marketing AI/ML

Join AWS as a Senior Applied Scientist in Marketing AI/ML, leading personalization and targeting initiatives.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Deep Learning Architect, Generative AI Innovation Center

Join AWS as a Deep Learning Architect in Milan to innovate with Generative AI and transform business opportunities.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Data Scientist, Generative AI Innovation Center

Join AWS as a Senior Data Scientist in Milan to innovate with Generative AI and solve real-world challenges.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Deep Learning Architect, AWS Generative AI Innovation Center

Join AWS as a Deep Learning Architect to innovate with Generative AI, solving real-world problems in a fast-paced environment.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Deep Learning Architect, Generative AI

Join AWS as a Senior Deep Learning Architect to innovate with Generative AI and transform business opportunities.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Software Development Engineer, AWS Compute Services

Join AWS as a Software Development Engineer to innovate in serverless computing. Work on large-scale systems in Austin, Texas.

Amazon logo
Amazon

Senior Software Engineer - Generative AI, AGI Inference Engine

Join Amazon as a Senior Software Engineer to advance Generative AI capabilities, focusing on high-performance inference.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Software Development Engineer, AWS Training and Certifications

Join AWS as a Software Development Engineer to build learning systems for millions of users, focusing on performance, scalability, and innovation.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Software Development Engineer, Amazon Connect Cases (AWS)

Join AWS as a Senior Software Development Engineer to lead impactful projects in cloud-based contact centers.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Specialist, GenAI Frameworks at AWS

Senior Specialist in GenAI Frameworks at AWS, focusing on market development and customer engagement through innovative AI solutions.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Deep Learning Architect, Generative AI

Join AWS as a Deep Learning Architect in Rome to innovate with Generative AI and transform business opportunities.

Amazon logo
Amazon

Senior Software Engineer - Generative AI

Join Amazon as a Senior Software Engineer in Generative AI, focusing on high-performance inference capabilities.