InstaDeep logo

Senior Machine Learning Engineer - Scaling and Performance Optimization

InstaDeep

About the Team

Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large-scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.

The Role

We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.

Responsibilities

  • Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
  • Performance Optimization: Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimizations.
  • Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
  • Algorithmic Optimization: Research and understand the latest deep learning literature to implement and optimize state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
  • Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.

Required Skills

  • Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
  • Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.).
  • Expertise with Python and/or C++.
  • Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.).
  • Passion for profiling, identifying bottlenecks, and delivering efficient solutions.

Highly Desirable

  • Track record of successfully scaling ML models.
  • Experience writing custom CUDA kernels or XLA operations.
  • Understanding of GPU/TPU architectures and their implications for efficient ML systems.
  • Fundamentals of modern Deep Learning.
  • Actively following ML trends and a desire to push boundaries.

Example Projects

  • Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
  • Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
  • Adapt algorithms for large-scale distributed architectures across HPC clusters.
  • Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.

What We Offer

  • Real-World Impact: Directly contribute to the performance and reach of our AI solutions.
  • Cutting-Edge Challenges: Tackle complex problems at the forefront of machine learning and large-scale system design.
  • Growth-Oriented Environment: Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.

Benefits
Extracted with AI

  • Real-World Impact
  • Cutting-Edge Challenges
  • Growth-Oriented Environment

Similar jobs

Last update: 23 minutes ago

InstaDeep logo
InstaDeep

Senior DevOps Engineer

Join InstaDeep as a Senior DevOps Engineer in Paris, working at the intersection of machine learning and engineering.

Wayve logo
Wayve

Senior Machine Learning Performance Engineer

Join Wayve as a Senior Machine Learning Performance Engineer to optimize large-scale training jobs and improve GPU efficiency.

Adobe logo
Adobe

Senior Machine Learning Engineer

Join Adobe as a Senior Machine Learning Engineer to lead Gen AI projects, focusing on document cloud technologies.

Amazon logo
Amazon

Senior Software Engineer - Generative AI, AGI Inference Engine

Join Amazon as a Senior Software Engineer to advance Generative AI capabilities, focusing on high-performance inference.

Amazon logo
Amazon

Senior Software Engineer - Generative AI

Join Amazon as a Senior Software Engineer in Generative AI, focusing on high-performance inference capabilities.

Meero logo
Meero

Senior Computer Vision Engineer

Join Meero as a Senior Computer Vision Engineer in Paris. Work on AI-driven image processing for e-commerce with cutting-edge technologies.

Zenseact logo
Zenseact

Next-Gen Deep Learning Engineer

Join Zenseact as a Deep Learning Engineer in Munich. Work on cutting-edge AI for autonomous driving.

Accrete AI logo
Accrete AI

Backend Engineer with Machine Learning Focus

Join Accrete AI as a Backend Engineer with a focus on machine learning, building scalable AI solutions.

Adobe logo
Adobe

Senior Generative AI/Machine Learning Engineer and Data Scientist

Join Adobe as a Senior Generative AI/Machine Learning Engineer and Data Scientist to innovate with AI models.

OpenAI logo
OpenAI

Software Engineer, Applied Engineering

Join OpenAI as a Software Engineer in Applied Engineering to develop innovative AI products using JavaScript, React, and Python.

Resolve AI logo
Resolve AI

AI Engineer with LLM Expertise

Join Resolve AI as an AI Engineer in San Francisco to build AI-powered workflows with LLM expertise.

Keysight Technologies logo
Keysight Technologies

Machine Learning/AI Engineer

Join Keysight Technologies as a Machine Learning/AI Engineer to develop and optimize AI/ML models for EDA applications.

Ampere logo
Ampere

Senior Applied AI Model Researcher

Join Ampere as a Senior Applied AI Model Researcher to lead AI model development and optimization in a remote role.

Epidemic Sound logo
Epidemic Sound

Senior Machine Learning Engineer

Join Epidemic Sound as a Senior Machine Learning Engineer to enhance search and discovery with ML in Stockholm.

Bloomreach logo
Bloomreach

Senior Machine Learning Engineer

Join Bloomreach as a Senior Machine Learning Engineer to design and implement AI-driven components for personalized digital experiences.

BIP logo
BIP

AI Engineer

Join BIP as an AI Engineer in Milan, leveraging AI, ML, and data science to create scalable solutions.

Airbnb logo
Airbnb

Senior Machine Learning Engineer, Marketing Technology

Join Airbnb as a Senior Machine Learning Engineer to enhance personalized customer experiences using AI/ML.

Meta logo
Meta

Software Engineer, Generative AI

Join Meta as a Software Engineer in Generative AI, focusing on NLP and large language models. Work with a global team to innovate AI products.

Intuit logo
Intuit

Senior Machine Learning Engineer

Join Intuit as a Senior Machine Learning Engineer to develop and deploy scalable data science models.

Partoo logo
Partoo

Lead Data Engineer

Join Partoo as a Lead Data Engineer in Paris, managing data pipelines, AI projects, and a team, with a focus on innovation and data security.

Swift logo
Swift

Head of Data Science - AI

Lead AI development at Swift, driving innovation and business value in a hybrid role in Brussels.

GlobalLogic logo
GlobalLogic

Senior Machine Learning/Generative AI Engineer

Join GlobalLogic as a Senior ML/GenAI Engineer to develop and optimize AI chatbots using LLMs. Remote work available.

Adobe logo
Adobe

Senior Software Engineer (Machine Learning & Infrastructure)

Join Adobe's AI team as a Senior Software Engineer to enhance Acrobat AI Assistant features, focusing on backend infrastructure and machine learning.

Clay logo
Clay

Senior Machine Learning Engineer

Join Clay as a Senior Machine Learning Engineer to lead ML-powered features and build models in a hybrid role.