InstaDeep logo

Senior Machine Learning Engineer - Scaling and Performance Optimization

InstaDeep

About the Team

Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large-scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.

The Role

We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.

Responsibilities

  • Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
  • Performance Optimization: Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimizations.
  • Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
  • Algorithmic Optimization: Research and understand the latest deep learning literature to implement and optimize state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
  • Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.

Required Skills

  • Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
  • Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.).
  • Expertise with Python and/or C++.
  • Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.).
  • Passion for profiling, identifying bottlenecks, and delivering efficient solutions.

Highly Desirable

  • Track record of successfully scaling ML models.
  • Experience writing custom CUDA kernels or XLA operations.
  • Understanding of GPU/TPU architectures and their implications for efficient ML systems.
  • Fundamentals of modern Deep Learning.
  • Actively following ML trends and a desire to push boundaries.

Example Projects

  • Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
  • Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
  • Adapt algorithms for large-scale distributed architectures across HPC clusters.
  • Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.

What We Offer

  • Real-World Impact: Directly contribute to the performance and reach of our AI solutions.
  • Cutting-Edge Challenges: Tackle complex problems at the forefront of machine learning and large-scale system design.
  • Growth-Oriented Environment: Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.

Benefits
Extracted with AI

  • Real-World Impact
  • Cutting-Edge Challenges
  • Growth-Oriented Environment

Similar jobs

Last update: 23 minutes ago

InstaDeep logo
InstaDeep

Senior DevOps Engineer

Join InstaDeep as a Senior DevOps Engineer in Paris, working at the intersection of machine learning and engineering.

InstaDeep logo
InstaDeep

Senior Backend Engineer

Join InstaDeep as a Senior Backend Engineer in Paris, working on cutting-edge AI technologies and backend development.

Clario logo
Clario

Senior AI Engineer - Machine Learning & Deep Learning

Senior AI Engineer specializing in Machine Learning and Deep Learning for healthcare technology in Belgium.

Index Exchange logo
Index Exchange

Senior Machine Learning & Optimization Engineer

Join Index Exchange as a Senior Machine Learning & Optimization Engineer in Berlin, focusing on MLOps, data science, and high-performance computing.

Scale AI logo
Scale AI

Senior Software Engineer, Machine Learning Infrastructure

Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.

Wayve logo
Wayve

Senior Machine Learning Performance Engineer

Join Wayve as a Senior Machine Learning Performance Engineer to optimize large-scale training jobs and improve GPU efficiency.

FlexAI logo
FlexAI

Senior Integration Software Engineer

Join FlexAI as a Senior Integration Software Engineer to revolutionize AI infrastructure with cutting-edge integration solutions.

Echo Analytics logo
Echo Analytics

Senior Machine Learning Engineer

Join Echo Analytics as a Senior Machine Learning Engineer in Paris. Leverage ML to drive data modeling and design intelligent data flows.

eyepop.ai logo
eyepop.ai

Senior Software Engineer - Machine Learning and Data Science

Join EyePop.ai as a Senior Software Engineer to develop and scale machine learning and data science software pipelines.

NVIDIA logo
NVIDIA

Senior Deep Learning Engineer

Join NVIDIA as a Senior Deep Learning Engineer to optimize AI performance using PyTorch, TensorFlow, and more in Berlin.

Ubisoft logo
Ubisoft

Senior Machine Learning Engineer - AI Program

Join Ubisoft as a Senior Machine Learning Engineer to innovate AI projects. Work with cutting-edge technologies in a hybrid environment.

Enchanted Tools logo
Enchanted Tools

Senior Software Engineer, LLM/VLM

Senior Software Engineer specializing in LLM/VLM for robotics integration in Paris. Deep Learning, MLOps, Embedded Systems expertise required.

Front logo
Front

Senior Applied AI Engineer

Join Front as a Senior Applied AI Engineer in Paris to develop and implement generative AI applications, enhancing product features and user experiences.

Vedrai logo
Vedrai

Senior Machine Learning Engineer

Senior Machine Learning Engineer role in Milan, Italy. In-depth experience with AI, ML, Python, AWS, and MLOps required.

Dataiku logo
Dataiku

Software Engineer - AI & Machine Learning

Join Dataiku as a Software Engineer in AI & Machine Learning, working with Java, Scala, and Angular in a remote role.

SiPearl logo
SiPearl

AI Benchmark Engineer

Join SiPearl as an AI Benchmark Engineer to optimize AI applications and frameworks for high-performance computing.

Intuit logo
Intuit

Senior Machine Learning Engineer

Join Intuit as a Senior Machine Learning Engineer to innovate and scale AI algorithms in San Diego.

In The Pocket logo
In The Pocket

Senior Machine Learning Engineer

Join In The Pocket as a Senior Machine Learning Engineer to scale AI applications, focusing on MLOps and NLP, in Bucharest.

NVIDIA logo
NVIDIA

Senior Deep Learning Performance Software Engineer

Senior role optimizing deep learning performance at NVIDIA, involving Python, HPC, and AI technologies.

Deepgram logo
Deepgram

Senior Software Engineer, AI Inference

Senior AI Inference Engineer specializing in backend development and optimization techniques for high-performance computing.

NVIDIA logo
NVIDIA

Machine Learning Engineer - LLM Fine-tuning and Performance

Join NVIDIA as a Machine Learning Engineer specializing in LLM fine-tuning and performance optimization. Work with cutting-edge ML technologies.

Databricks logo
Databricks

Senior AI Security Engineer

Senior AI Security Engineer role focusing on AI system security, vulnerability management, and research in Paris.

AXA Group Operations logo
AXA Group Operations

Senior Machine Learning Engineer

Join AXA Group Operations as a Senior Machine Learning Engineer in Paris, leveraging AI to innovate and protect customers.

Clario logo
Clario

Senior AI Engineer - Machine Learning & Deep Learning

Senior AI Engineer role focusing on Machine Learning and Deep Learning in Budapest, Hungary. Hybrid work, competitive benefits.