Amazon Web Services (AWS) logo

Senior Software Engineer - AI/ML, AWS Neuron Distributed Training

Amazon Web Services (AWS)

Job Overview

Amazon Web Services (AWS) is seeking a Senior Software Engineer to join the Machine Learning Applications (ML Apps) team, focusing on AWS Neuron for distributed training. This role involves building, delivering, and maintaining complex products that impact millions globally, designing fault-tolerant systems that operate at massive scale in the AWS Cloud.

Responsibilities

  • Lead the development of distributed training support in Pytorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.
  • Tune ML models to ensure high performance and efficiency on AWS Trainium and Inferentia silicon and TRn1, Inf1 servers.
  • Collaborate with chip architects, compiler engineers, and runtime engineers to build and optimize distributed training solutions.

Qualifications

Basic Qualifications

  • 3+ years of non-internship professional software development experience.
  • Experience in design or architecture of new and existing systems.
  • Proficiency in programming with at least one software programming language.
  • Deep learning industry experience.

Preferred Qualifications

  • Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Bachelor's degree in computer science or equivalent.
  • Expertise in Pytorch, Jax, Tensorflow, distributed libraries, and frameworks.
  • Experience in end-to-end model training.

About the Team

The ML Apps team at AWS Neuron works closely with various disciplines including silicon engineering, hardware design and verification, software, and operations. The team is dedicated to supporting new members, promoting knowledge sharing and mentorship, and is committed to providing a work environment that balances professional challenges with personal life.

Benefits

  • Flexible working hours to support work-life balance.
  • Opportunities for mentorship and career growth within the company.
  • Inclusive team culture with a focus on employee well-being.

Benefits
Extracted with AI

  • Flexible working hours
  • Work-life balance support
  • Mentorship and career growth opportunities

Similar jobs

Last update: 23 minutes ago

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Systems Engineer, Managed Operations

Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.

Nebius AI logo
Nebius AI

Senior Software Engineer (C++)

Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.

Nebius AI logo
Nebius AI

Senior Backend Engineer (Go)

Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.

HeyJobs logo
HeyJobs

Senior Software Engineer - AWS, Python, Ruby on Rails

Join HeyJobs as a Senior Software Engineer to design scalable systems using AWS, Python, and Ruby on Rails in a dynamic team.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Haufe Akademie logo
Haufe Akademie

Software Engineer AWS & TypeScript

Join Haufe Akademie as a Software Engineer specializing in AWS & TypeScript to develop innovative cloud solutions in Freiburg.

Basetime BV logo
Basetime BV

Senior Python Developer with AWS Experience

Join Basetime BV as a Senior Python Developer to develop and maintain AWS cloud solutions. Hybrid work, competitive salary, and growth opportunities.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!

Together AI logo
Together AI

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

Motius logo
Motius

Senior Backend Developer

Join Motius as a Senior Backend Developer to work on cutting-edge R&D projects using AWS, Docker, GraphQL, and more in a hybrid work environment.

9am logo
9am

Senior Solution Architect AWS (Freelance)

Seeking a Senior Solution Architect AWS for a freelance role in Frankfurt, Germany. Remote work available. Extensive AWS experience required.

Cere Network logo
Cere Network

Principal AI Engineer

Join Cere Network as a Principal AI Engineer to drive AI innovation in Web3. Requires 10+ years in AI/ML, NLP, and software development.

Aiven logo
Aiven

Senior Software Engineer - Python, Apache Kafka

Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.

Covestro logo
Covestro

Senior DevOps Engineer - Price & Deal Management

Join Covestro as a Senior DevOps Engineer to drive digital transformation in pricing and deal management with AWS, Docker, and Java expertise.

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

PostNord logo
PostNord

Senior Node.js Backend Developer

Join PostNord as a Senior Node.js Backend Developer to drive innovation in our online shipping tool. Work in an agile team with AWS, Node.js, and more.

Omnia Retail logo
Omnia Retail

Senior Software Developer - AWS, Java, Python, Scala

Join Omnia Retail as a Senior Software Developer, working with AWS, Java, Python, and Scala in a hybrid role in Frankfurt.

Applied Intuition logo
Applied Intuition

Software Engineer - Autonomous Driving

Join Applied Intuition as a Software Engineer in Munich to tackle autonomous driving challenges with top ADAS/AV programs.

CHEEX logo
CHEEX

Backend Developer with AWS and Spring Boot Experience

Join CHEEX as a Backend Developer in Berlin. Work with AWS, Spring Boot, and Angular in a progressive, inclusive environment.

VIAVI Solutions logo
VIAVI Solutions

Senior Software Engineer (C++, Python & Cloud)

Join VIAVI Solutions as a Senior Software Engineer specializing in C++, Python, and cloud technologies. Work in a hybrid environment in Berlin.

netgo logo
netgo

Senior Cloud DevOps Engineer

Join netgo as a Senior Cloud DevOps Engineer in Berlin. Work with Kubernetes, GitOps, and more in a dynamic team environment.

Nederlandse Spoorwegen logo
Nederlandse Spoorwegen

Senior Lead Fullstack Developer with Frontend Focus

Join NS as a Senior Lead Fullstack Developer focusing on frontend, working with Angular, AWS, and TypeScript in a dynamic e-commerce environment.

CARFAX Europe logo
CARFAX Europe

Senior DevOps Engineer

Join CARFAX Europe as a Senior DevOps Engineer to manage AWS infrastructure, develop CI/CD pipelines, and enhance system observability.