InstaDeep logo

Senior Machine Learning Engineer - Scaling and Performance Optimization

InstaDeep

About the Team

Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large-scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.

The Role

We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.

Responsibilities

  • Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
  • Performance Optimization: Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimizations.
  • Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
  • Algorithmic Optimization: Research and understand the latest deep learning literature to implement and optimize state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
  • Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.

Required Skills

  • Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
  • Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.).
  • Expertise with Python and/or C++.
  • Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.).
  • Passion for profiling, identifying bottlenecks, and delivering efficient solutions.

Highly Desirable

  • Track record of successfully scaling ML models.
  • Experience writing custom CUDA kernels or XLA operations.
  • Understanding of GPU/TPU architectures and their implications for efficient ML systems.
  • Fundamentals of modern Deep Learning.
  • Actively following ML trends and a desire to push boundaries.

Example Projects

  • Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
  • Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
  • Adapt algorithms for large-scale distributed architectures across HPC clusters.
  • Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.

What We Offer

  • Real-World Impact: Directly contribute to the performance and reach of our AI solutions.
  • Cutting-Edge Challenges: Tackle complex problems at the forefront of machine learning and large-scale system design.
  • Growth-Oriented Environment: Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.

Benefits
Extracted with AI

  • Real-World Impact
  • Cutting-Edge Challenges
  • Growth-Oriented Environment

Similar jobs

Last update: 23 minutes ago

InstaDeep logo
InstaDeep

Senior DevOps Engineer

Join InstaDeep as a Senior DevOps Engineer in Paris, working at the intersection of machine learning and engineering.

Wayve logo
Wayve

Senior Machine Learning Performance Engineer

Join Wayve as a Senior Machine Learning Performance Engineer to optimize large-scale training jobs and improve GPU efficiency.

Zenseact logo
Zenseact

Next-Gen Deep Learning Engineer

Join Zenseact as a Deep Learning Engineer in Munich. Work on cutting-edge AI for autonomous driving.

Meero logo
Meero

Senior Computer Vision Engineer

Join Meero as a Senior Computer Vision Engineer in Paris. Work on AI-driven image processing for e-commerce with cutting-edge technologies.

Accrete AI logo
Accrete AI

Backend Engineer with Machine Learning Focus

Join Accrete AI as a Backend Engineer with a focus on machine learning, building scalable AI solutions.

Intuit logo
Intuit

Senior Machine Learning Engineer

Join Intuit as a Senior Machine Learning Engineer to develop and deploy scalable data science models.

Bloomreach logo
Bloomreach

Senior Machine Learning Engineer

Join Bloomreach as a Senior Machine Learning Engineer to design and implement AI-driven components for personalized digital experiences.

Airbnb logo
Airbnb

Senior Machine Learning Engineer, Marketing Technology

Join Airbnb as a Senior Machine Learning Engineer to enhance personalized customer experiences using AI/ML.

Intuit logo
Intuit

Senior Machine Learning Engineer

Join Intuit as a Senior Machine Learning Engineer to develop and deploy data science models at scale using cutting-edge tools.

Microsoft logo
Microsoft

Software Engineer 2 - AI and Machine Learning

Join Microsoft as a Software Engineer 2 in Barcelona to innovate in AI and Machine Learning with a focus on large-scale data projects.

Partoo logo
Partoo

Lead Data Engineer

Join Partoo as a Lead Data Engineer in Paris, managing data pipelines, AI projects, and a team, with a focus on innovation and data security.

Meero logo
Meero

Lead Engineering Manager AI

Lead Engineering Manager AI role in Paris, focusing on AI-driven image production for e-commerce with Meero's AutoRetouch product line.

Algorhythm logo
Algorhythm

AI Engineer with Computer Vision and NLP Expertise

Join Algorhythm as an AI Engineer to develop cutting-edge AI solutions with a focus on Computer Vision and NLP.

Adobe logo
Adobe

Senior Software Engineer (Machine Learning & Infrastructure)

Join Adobe's AI team as a Senior Software Engineer to enhance Acrobat AI Assistant features, focusing on backend infrastructure and machine learning.

Accrete AI logo
Accrete AI

Principal Software Engineer - AI Platform

Join Accrete AI as a Principal Software Engineer to lead AI platform development, leveraging AI/ML frameworks and cloud technologies.

Intapp logo
Intapp

Senior MLOps Engineer

Join Intapp as a Senior MLOps Engineer to design, build, and maintain secure, scalable ML platforms. Remote position in Portugal.

Intapp logo
Intapp

Machine Learning Engineer (NLP)

Join Intapp as a Machine Learning Engineer (NLP) in Berlin. Work on AI/ML solutions, leveraging NLP technologies. Competitive salary and benefits.

owl.co logo
owl.co

Senior Deep Learning Engineer

Join Owl.co as a Senior Deep Learning Engineer to develop AI solutions for insurance claims in New York County, NY.

Google logo
Google

Software Engineer, YouTube Knowledge

Join Google as a Software Engineer on the YouTube Knowledge team in Paris, working on large-scale systems and machine learning.

Amazon logo
Amazon

Senior Applied Scientist - Machine Learning and Data Science

Join Amazon as a Senior Applied Scientist in Palo Alto, focusing on machine learning and data science for advertising.

Inclusively logo
Inclusively

Senior Software Engineer, Machine Learning

Join as a Senior Software Engineer in Machine Learning, working remotely to build ML-driven products for user engagement.

OVERJET logo
OVERJET

Senior Machine Learning Engineer

Join Overjet as a Senior Machine Learning Engineer to lead AI/ML model development and deployment in dental care.

Amazon logo
Amazon

Senior Software Development Engineer, Applied AI

Join Amazon's Applied AI team as a Senior Software Development Engineer to innovate with AI technologies.

Vectra AI logo
Vectra AI

Senior Software Engineer - Python and Cloud

Join Vectra AI as a Senior Software Engineer in Dublin, focusing on Python, cloud, and cybersecurity.