Job Description
Are you interested in advancing Amazon's Generative AI capabilities? Join a talented team of engineers and scientists in a highly collaborative and friendly environment. We are building state-of-the-art Generative AI technology that will benefit all Amazon businesses and customers.
Key Responsibilities
As a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high-performance inference capabilities. This includes multi-modality, state-of-the-art model architectures, latency, throughput, and cost optimization. You will collaborate closely with a team of engineers and scientists to influence our overall strategy and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.
A Day in the Life
- Read papers and consult with scientists to get inspiration from emerging techniques and integrate them into our roadmap.
- Design and experiment with new algorithms, benchmark the latency and accuracy of your implementations.
- Implement production-grade solutions and oversee their deployment.
- Collaborate with other science and engineering teams to ensure proper execution.
- Maintain high standards in operational excellence and support production systems, constantly creating solutions to minimize operational load.
About The Team
Our mission is to build best-in-class, fast, accurate, and cost-efficient large language model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
Basic Qualifications
- 3+ years of non-internship professional software development experience.
- 2+ years of non-internship design or architecture experience of new and existing systems.
- Experience programming with at least one software programming language.
- Prior experience with software performance optimization or knowledge of Machine Learning and Deep Learning.
Preferred Qualifications
- 3+ years of full software development life cycle experience, including coding standards, code reviews, source control management, build processes, testing, and operations.
- Bachelor's degree in computer science or equivalent.
- Experience with Large Language Model inference.
- Experience with Trainium and Inferentia Development.
- Experience with GPU programming (TensorRT-LLM).
- Experience with Python, PyTorch, and C++ programming and performance optimization.
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit Amazon's disability accommodations page.
Compensation
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits.
Benefits Extracted with AI
- Medical benefits
- Financial benefits
- Equity
- Sign-on payments
Similar jobs
Last update: 23 minutes ago
Senior C++ Computer Vision Engineer
Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.
Senior Fullstack Developer for AI-Driven Mission Technologies
Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.
Senior Software Engineer - Generative AI, AGI Inference Engine
Join Amazon as a Senior Software Engineer to advance Generative AI capabilities, focusing on high-performance inference.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Senior Systems Engineer, Managed Operations
Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.
Staff Software Engineer - Backend
Join Uber as a Staff Software Engineer - Backend, focusing on membership systems. Work with Java, Python, C++, and more in Amsterdam.
Senior Full-Stack Engineer ReactJS/NodeJS
Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.
Senior Software Engineer III - Java
Join TomTom as a Senior Software Engineer III in Amsterdam, focusing on Java and back-end development in a hybrid work environment.
Senior Backend Engineer - Java, Rust, Go
Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.
Senior Software Engineer - Python, Apache Kafka
Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering
Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.
Staff Software Engineer - Backend
Join Uber as a Staff Software Engineer - Backend to develop and enhance solutions for millions of members globally.
Backend Software Engineer - Privacy Technology
Join Zalando as a Backend Software Engineer in Privacy Technology, focusing on data protection and privacy automation services.
Senior Software Engineer - AWS, Python, Ruby on Rails
Join HeyJobs as a Senior Software Engineer to design scalable systems using AWS, Python, and Ruby on Rails in a dynamic team.
Senior Backend Engineer - Payments
Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.
AI Engineer
Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.
Lead Developer with DevOps and Functional Programming
Join Reaktor as a Lead Developer in Amsterdam, focusing on DevOps, Functional Programming, and JavaScript in a hybrid work environment.
Senior Backend/Data Engineer
Join Zalando as a Senior Backend/Data Engineer in Berlin to enhance our audience-building platform using AWS, Java, Scala, and SQL.
Senior Backend Engineer (Go)
Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.
Senior Solutions Engineer
Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.
Senior Software Engineer: Configuration Management/Deployment
Join Uber's Amsterdam team as a Senior Software Engineer focusing on configuration management and deployment. Solve infrastructure challenges at scale.
Full Stack Developer with AI and API Expertise
Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.
Senior Software Engineer (C++)
Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.