About the Role
Wayve is seeking a Senior Machine Learning Performance Engineer to join our Machine Learning Platform team. This role is pivotal in optimizing large-scale training jobs as we aim to scale our models to the next order of magnitude. The Machine Learning Platform team is responsible for our GPU training infrastructure and the software abstractions around it. Your primary focus will be on improving training efficiency.
Key Responsibilities
- Maximize MFU: Work on maximizing the Machine Fractional Utilization (MFU) of our large-scale training jobs.
- Code Profiling: Profile and identify bottlenecks in training code to enhance performance.
- GPU Kernel Implementation: Implement GPU kernels to improve training throughput.
- Collaboration: Work closely with research teams to integrate and test training efficiency improvements.
- Cluster Management: Own and improve our GPU training clusters.
About You
Essential Qualifications:
- Over 5 years of experience in performance optimization or machine learning engineering.
- Proven experience in optimizing large-scale training jobs on GPU compute clusters.
- Experience working in platform teams and collaborating with research teams.
- Ability to report and track benchmarked performance over time in an open and accessible way.
- Proficiency in writing high-quality, well-structured, and tested Python code.
- BS or MS in Machine Learning, Computer Science, Engineering, or a related technical discipline, or equivalent experience.
Desirable Skills:
- Solid experience with concurrent, parallel, and distributed computing.
- Experience using Nvidia NSight Systems.
- Experience implementing GPU kernels.
- Knowledge of computing fundamentals, including what makes code fast, secure, and reliable.
Why Join Us?
At Wayve, we are committed to creating a diverse, fair, and respectful culture that is inclusive of everyone based on their unique skills and perspectives. We value diversity, embrace new perspectives, and foster an inclusive work environment. Join us to tackle today's most complex challenges and pave the way for a smarter, safer future.
Work Environment
This is a full-time role based in our office in Mountain View, California. We operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships, and learning, with time spent working from home. We operate core working hours so you can determine the schedule that works best for you and your team.
If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.
Benefits Extracted with AI
- Hybrid working policy
- Core working hours
Similar jobs
Last update: 23 minutes ago
Machine Learning Engineer, Training
Join Waymo as a Machine Learning Engineer to develop distributed training infrastructure for autonomous driving.
Machine Learning Engineer - LLM Fine-tuning and Performance
Join NVIDIA as a Machine Learning Engineer specializing in LLM fine-tuning and performance optimization. Work with cutting-edge ML technologies.
ML Compiler Engineer, Compute
Join Waymo as an ML Compiler Engineer in NYC, enhancing neural network performance with cutting-edge technology.
Senior Deep Learning Performance Software Engineer
Senior role optimizing deep learning performance at NVIDIA, involving Python, HPC, and AI technologies.
Machine Learning Engineer
Join Vectara as a Machine Learning Engineer to work on advanced language understanding and neural methods.
Senior Software Engineer, Autonomous Driving Behaviors
Senior Software Engineer for Autonomous Driving Behaviors, focusing on simulation software development with ML expertise.
Senior Machine Learning Engineer - Scaling and Performance Optimization
Join InstaDeep as a Senior Machine Learning Engineer focusing on scaling and performance optimization in Paris. Work with cutting-edge AI technologies.
Principal Software Engineer, Storage & Compute
Lead software engineer role focusing on storage & compute infrastructure for autonomous driving tech at Waymo.
AI Engineer for Software-Defined Vehicles
AI Engineer role focusing on software-defined vehicles, requiring skills in AI, ML, and cloud technologies.
Software Engineer, Commercialization Engineering
Join Waymo as a Software Engineer in Commercialization Engineering, focusing on infrastructure for driverless cars.
Senior Software Engineer, Machine Learning Infrastructure
Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.
Software Engineer - Autonomous Vehicle Technology
Join Waabi as a Software Engineer to develop cutting-edge autonomous vehicle technology. Remote work with competitive benefits.
Senior Deep Learning Engineer
Join NVIDIA as a Senior Deep Learning Engineer to optimize AI performance using PyTorch, TensorFlow, and more in Berlin.
Senior Artificial Intelligence Algorithms Engineer
Senior AI Algorithms Engineer role focusing on AI/DL, data analytics, and machine learning in Santa Clara, CA.
Senior Software Engineer - Machine Learning and Data Science
Join EyePop.ai as a Senior Software Engineer to develop and scale machine learning and data science software pipelines.
Senior AI Engineer
Join Zep AI as a Senior AI Engineer to lead LLM-based AI solutions development in a hybrid work environment.
Machine Learning Engineer
Join Ema Unlimited as a Machine Learning Engineer in SF Bay Area, working on cutting-edge AI solutions with a focus on NLP and ML technologies.
Machine Learning Engineer
Join Refuel as a Machine Learning Engineer to develop core ML algorithms, improve datasets, and collaborate on product scalability.
Senior Machine Learning Researcher
Join Lambda as a Senior Machine Learning Researcher to develop AI models and optimize ML workloads. Work in San Jose with flexible benefits.
Next-Gen Deep Learning Engineer
Join Zenseact as a Deep Learning Engineer in Munich. Work on cutting-edge AI for autonomous driving.
Senior AI Engineer
Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.
Software Engineer, Machine Learning Infrastructure
Join Tesla as a Software Engineer in ML Infrastructure to optimize and scale neural network training with Python, C++, and PyTorch.
Machine Learning Software Engineer
Join Arm's Machine Learning Software team in Galway to develop high-performance ML applications on Arm hardware.
Senior Software Engineer - LLM
Join Bonfy.AI as a Senior Software Engineer to develop and optimize scalable machine learning models using Python, TensorFlow, and cloud platforms.