Amazon Web Services (AWS) logo

Senior Software Engineer - AI/ML, AWS Neuron Distributed Training

Amazon Web Services (AWS)

Job Overview

Amazon Web Services (AWS) is seeking a Senior Software Engineer to join the Machine Learning Applications (ML Apps) team, focusing on AWS Neuron for distributed training. This role involves building, delivering, and maintaining complex products that impact millions globally, designing fault-tolerant systems that operate at massive scale in the AWS Cloud.

Responsibilities

  • Lead the development of distributed training support in Pytorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.
  • Tune ML models to ensure high performance and efficiency on AWS Trainium and Inferentia silicon and TRn1, Inf1 servers.
  • Collaborate with chip architects, compiler engineers, and runtime engineers to build and optimize distributed training solutions.

Qualifications

Basic Qualifications

  • 3+ years of non-internship professional software development experience.
  • Experience in design or architecture of new and existing systems.
  • Proficiency in programming with at least one software programming language.
  • Deep learning industry experience.

Preferred Qualifications

  • Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Bachelor's degree in computer science or equivalent.
  • Expertise in Pytorch, Jax, Tensorflow, distributed libraries, and frameworks.
  • Experience in end-to-end model training.

About the Team

The ML Apps team at AWS Neuron works closely with various disciplines including silicon engineering, hardware design and verification, software, and operations. The team is dedicated to supporting new members, promoting knowledge sharing and mentorship, and is committed to providing a work environment that balances professional challenges with personal life.

Benefits

  • Flexible working hours to support work-life balance.
  • Opportunities for mentorship and career growth within the company.
  • Inclusive team culture with a focus on employee well-being.

Benefits
Extracted with AI

  • Flexible working hours
  • Work-life balance support
  • Mentorship and career growth opportunities

Similar jobs

Last update: 23 minutes ago

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Machine Learning Engineer, AWS Neuron Apps

Senior ML Engineer needed for AWS Neuron Apps, focusing on ML Inference with expertise in Python, TensorFlow, and distributed computing.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Systems Engineer, Managed Operations

Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.

Nebius AI logo
Nebius AI

Senior Software Engineer (C++)

Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Software Development Manager, ML Acceleration

Lead the development of ML Inference features in a senior managerial role at AWS, focusing on distributed systems and ML frameworks.

Nebius AI logo
Nebius AI

Senior Backend Engineer (Go)

Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Amazon logo
Amazon

Senior Software Development Engineer, Applied AI

Join Amazon's Applied AI team as a Senior Software Development Engineer to innovate with AI technologies.

HeyJobs logo
HeyJobs

Senior Software Engineer - AWS, Python, Ruby on Rails

Join HeyJobs as a Senior Software Engineer to design scalable systems using AWS, Python, and Ruby on Rails in a dynamic team.

Amazon logo
Amazon

Senior Software Engineer, Machine Learning Infrastructure

Join Amazon's Search team as a Senior Software Engineer in ML Infrastructure, focusing on large-scale distributed systems and deep learning.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior ML Engineer - Generative AI Innovation Center

Senior ML Engineer needed in Milan for AWS, focusing on generative AI innovations, requiring expertise in AI, ML, Python, and cloud technologies.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Worldwide Specialist, GenAI Model Training & Inference

Join AWS as a Senior Specialist in GenAI Model Training & Inference, driving customer adoption and scaling workloads.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Software Development Engineer Intern - Machine Learning Chip Architect

Join AWS as a Software Development Engineer Intern focusing on Machine Learning Chip Architecture. Enhance your skills in a dynamic environment.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

Amazon logo
Amazon

Senior Machine Learning Engineer

Join Amazon as a Senior Machine Learning Engineer to build scalable AI/ML infrastructure and MLOps platforms.

Lambda logo
Lambda

Senior Software Engineer - Cloud

Join Lambda as a Senior Software Engineer to build the world's best deep learning cloud using AWS, Python, and distributed systems.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Applied Scientist, Artificial General Intelligence

Join AWS as an Applied Scientist in Artificial General Intelligence, driving AI innovation in cloud computing.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Software Development Engineer - Aurora Limitless Database

Join AWS as a Senior Software Development Engineer to innovate in cloud database services with Aurora Limitless Database.

Haufe Akademie logo
Haufe Akademie

Software Engineer AWS & TypeScript

Join Haufe Akademie as a Software Engineer specializing in AWS & TypeScript to develop innovative cloud solutions in Freiburg.

Basetime BV logo
Basetime BV

Senior Python Developer with AWS Experience

Join Basetime BV as a Senior Python Developer to develop and maintain AWS cloud solutions. Hybrid work, competitive salary, and growth opportunities.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Applied Scientist, AWS Marketing AI/ML

Join AWS as a Senior Applied Scientist in Marketing AI/ML, leading personalization and targeting initiatives.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Software Development Engineer III, AWS IDEs

Join AWS as a Senior Software Development Engineer to build AI/ML tools for developers, enhancing cloud experiences.

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.