About the Role
Wayve is seeking a Senior Machine Learning Performance Engineer to join our Machine Learning Platform team. This role is pivotal in optimizing large-scale training jobs as we aim to scale our models to the next order of magnitude. The Machine Learning Platform team is responsible for our GPU training infrastructure and the software abstractions around it. Your primary focus will be on improving training efficiency.
Key Responsibilities
- Maximize MFU: Work on maximizing the Machine Fractional Utilization (MFU) of our large-scale training jobs.
- Code Profiling: Profile and identify bottlenecks in training code to enhance performance.
- GPU Kernel Implementation: Implement GPU kernels to improve training throughput.
- Collaboration: Work closely with research teams to integrate and test training efficiency improvements.
- Cluster Management: Own and improve our GPU training clusters.
About You
Essential Qualifications:
- Over 5 years of experience in performance optimization or machine learning engineering.
- Proven experience in optimizing large-scale training jobs on GPU compute clusters.
- Experience working in platform teams and collaborating with research teams.
- Ability to report and track benchmarked performance over time in an open and accessible way.
- Proficiency in writing high-quality, well-structured, and tested Python code.
- BS or MS in Machine Learning, Computer Science, Engineering, or a related technical discipline, or equivalent experience.
Desirable Skills:
- Solid experience with concurrent, parallel, and distributed computing.
- Experience using Nvidia NSight Systems.
- Experience implementing GPU kernels.
- Knowledge of computing fundamentals, including what makes code fast, secure, and reliable.
Why Join Us?
At Wayve, we are committed to creating a diverse, fair, and respectful culture that is inclusive of everyone based on their unique skills and perspectives. We value diversity, embrace new perspectives, and foster an inclusive work environment. Join us to tackle today's most complex challenges and pave the way for a smarter, safer future.
Work Environment
This is a full-time role based in our office in Mountain View, California. We operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships, and learning, with time spent working from home. We operate core working hours so you can determine the schedule that works best for you and your team.
If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.
Benefits Extracted with AI
- Hybrid working policy
- Core working hours
Similar jobs
Last update: 23 minutes ago
Machine Learning Engineer, Training
Join Waymo as a Machine Learning Engineer to develop distributed training infrastructure for autonomous driving.
Next-Gen Deep Learning Engineer
Join Zenseact as a Deep Learning Engineer in Munich. Work on cutting-edge AI for autonomous driving.
Remote Software Engineer
Join Waabi as a Remote Software Engineer to develop cutting-edge self-driving technology. Work with AI, Python, C++, and more.
Machine Learning Engineer with AI/ML Experience
Join us as a Machine Learning Engineer to develop AI/ML models and applications. Work remotely with top-tier companies.
Senior Machine Learning Engineer, Marketing Technology
Join Airbnb as a Senior Machine Learning Engineer to enhance personalized customer experiences using AI/ML.
AI Engineer with Computer Vision and NLP Expertise
Join Algorhythm as an AI Engineer to develop cutting-edge AI solutions with a focus on Computer Vision and NLP.
Machine Learning Engineer
Join Ema Unlimited as a Machine Learning Engineer in SF Bay Area, working on cutting-edge AI solutions with a focus on NLP and ML technologies.
Senior Machine Learning Engineer
Join Overjet as a Senior Machine Learning Engineer to lead AI/ML model development and deployment in dental care.
Senior Machine Learning Engineer
Join Bloomreach as a Senior Machine Learning Engineer to design and implement AI-driven components for personalized digital experiences.
Senior AI Engineer - NLP and LLMs
Join Aon as a Senior AI Engineer in Dublin, focusing on NLP and LLMs, with flexible hybrid work options.
Senior ML Infrastructure Engineer
Join CHAI: AI Platform as a Senior ML Infrastructure Engineer to build and scale ML systems in Palo Alto.
Machine Learning Engineer for Vehicle Safety Systems
Join Porsche AG as a Machine Learning Engineer to enhance vehicle safety systems using AI and data science.
Software Engineer, Reasoning Foundation
Join Waymo as a Software Engineer in Reasoning Foundation to develop cutting-edge autonomous driving technology.
Software Engineer III, Machine Learning
Join Google as a Software Engineer III in Machine Learning, focusing on large-scale systems and AI.
Software Engineer 2 - AI and Machine Learning
Join Microsoft as a Software Engineer 2 in Barcelona to innovate in AI and Machine Learning with a focus on large-scale data projects.
Senior Machine Learning Engineer
Join Intuit as a Senior Machine Learning Engineer to develop and deploy scalable data science models.
Senior Applied AI Researcher
Join CHAI: AI Platform as a Senior Applied AI Researcher to optimize and innovate AI solutions in a high-growth environment.
Senior Machine Learning Engineer
Join Intuit as a Senior Machine Learning Engineer to develop and deploy data science models at scale using cutting-edge tools.
Senior Software Engineer, Machine Learning
Join as a Senior Software Engineer in Machine Learning, working remotely to build ML-driven products for user engagement.
Cloud Solution Engineer - GPU/Gaudi AI Accelerator
Join Intel as a Cloud Solution Engineer focusing on GPU/Gaudi AI Accelerator technologies for AI-driven applications.
Senior Software Engineer, Machine Learning
Join Niantic as a Senior Software Engineer in Machine Learning to enhance products using generative AI technologies.
Machine Learning Engineer - Ads
Join as a Machine Learning Engineer focusing on Ads, developing predictive models in a hybrid role in New York.
Senior Distributed Systems Engineer
Join webAI as a Senior Distributed Systems Engineer to design and maintain scalable systems using Python, Kubernetes, and more.
AI Solutions Software Engineer
Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama.