About the Role
Wayve is seeking a Senior Machine Learning Performance Engineer to join our Machine Learning Platform team. This role is pivotal in optimizing large-scale training jobs as we aim to scale our models to the next order of magnitude. The Machine Learning Platform team is responsible for our GPU training infrastructure and the software abstractions around it. Your primary focus will be on improving training efficiency.
Key Responsibilities
- Maximize MFU: Work on maximizing the Machine Fractional Utilization (MFU) of our large-scale training jobs.
- Code Profiling: Profile and identify bottlenecks in training code to enhance performance.
- GPU Kernel Implementation: Implement GPU kernels to improve training throughput.
- Collaboration: Work closely with research teams to integrate and test training efficiency improvements.
- Cluster Management: Own and improve our GPU training clusters.
About You
Essential Qualifications:
- Over 5 years of experience in performance optimization or machine learning engineering.
- Proven experience in optimizing large-scale training jobs on GPU compute clusters.
- Experience working in platform teams and collaborating with research teams.
- Ability to report and track benchmarked performance over time in an open and accessible way.
- Proficiency in writing high-quality, well-structured, and tested Python code.
- BS or MS in Machine Learning, Computer Science, Engineering, or a related technical discipline, or equivalent experience.
Desirable Skills:
- Solid experience with concurrent, parallel, and distributed computing.
- Experience using Nvidia NSight Systems.
- Experience implementing GPU kernels.
- Knowledge of computing fundamentals, including what makes code fast, secure, and reliable.
Why Join Us?
At Wayve, we are committed to creating a diverse, fair, and respectful culture that is inclusive of everyone based on their unique skills and perspectives. We value diversity, embrace new perspectives, and foster an inclusive work environment. Join us to tackle today's most complex challenges and pave the way for a smarter, safer future.
Work Environment
This is a full-time role based in our office in Mountain View, California. We operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships, and learning, with time spent working from home. We operate core working hours so you can determine the schedule that works best for you and your team.
If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.
Benefits Extracted with AI
- Hybrid working policy
- Core working hours
Similar jobs
Last update: 23 minutes ago
Machine Learning Engineer, Training
Join Waymo as a Machine Learning Engineer to develop distributed training infrastructure for autonomous driving.
Machine Learning Engineer - LLM Fine-tuning and Performance
Join NVIDIA as a Machine Learning Engineer specializing in LLM fine-tuning and performance optimization. Work with cutting-edge ML technologies.
Senior C++ Computer Vision Engineer
Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.
Expert Machine Learning Engineer
Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.
ML Compiler Engineer, Compute
Join Waymo as an ML Compiler Engineer in NYC, enhancing neural network performance with cutting-edge technology.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Software Engineer - Autonomous Driving
Join Applied Intuition as a Software Engineer in Munich to tackle autonomous driving challenges with top ADAS/AV programs.
Senior AI Engineer
Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.
Senior Software Engineer - LLM
Join Bonfy.AI as a Senior Software Engineer to develop and optimize scalable machine learning models using Python, TensorFlow, and cloud platforms.
Senior Backend Engineer - Java, Rust, Go
Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.
Senior IoT Engineer
Join Skytree as a Senior IoT Engineer to lead IoT projects, focusing on Azure IoT solutions, edge computing, and data pipelines.
Senior Machine Learning Engineer - Scaling and Performance Optimization
Join InstaDeep as a Senior Machine Learning Engineer focusing on scaling and performance optimization in Paris. Work with cutting-edge AI technologies.
Senior Deep Learning Performance Software Engineer
Senior role optimizing deep learning performance at NVIDIA, involving Python, HPC, and AI technologies.
Machine Learning Engineer
Join Vectara as a Machine Learning Engineer to work on advanced language understanding and neural methods.
AI Engineer with Machine Learning and Deep Learning Expertise
Join Pulley as an AI Engineer to develop AI-driven solutions, enhance internal tools, and collaborate with cross-functional teams.
Senior Software Engineer, Autonomous Driving Behaviors
Senior Software Engineer for Autonomous Driving Behaviors, focusing on simulation software development with ML expertise.
Principal Software Engineer, Storage & Compute
Lead software engineer role focusing on storage & compute infrastructure for autonomous driving tech at Waymo.
Production Engineer
Join Optiver as a Production Engineer in Amsterdam to manage live trading environments and enhance system reliability and performance.
AI Engineer - Machine Learning and Robotics
Join Blueprint as an AI Engineer in Machine Learning and Robotics, focusing on scalable AI model training systems. Hybrid role in Redmond, WA.
Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering
Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.
LLM Backend Developer
Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.
Information Retrieval Algorithm Engineer
Join Huawei as an Information Retrieval Algorithm Engineer to develop cutting-edge AI technologies in Amsterdam.
AI Engineer for Software-Defined Vehicles
AI Engineer role focusing on software-defined vehicles, requiring skills in AI, ML, and cloud technologies.
Senior Software Engineer - LLM Inference
Join CentML as a Senior Software Engineer focusing on LLM Inference, leveraging AI, ML, and GPU technologies.