InstaDeep logo

Senior Machine Learning Engineer - Scaling and Performance Optimization

InstaDeep

About the Team

Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large-scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.

The Role

We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.

Responsibilities

  • Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
  • Performance Optimization: Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimizations.
  • Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
  • Algorithmic Optimization: Research and understand the latest deep learning literature to implement and optimize state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
  • Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.

Required Skills

  • Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
  • Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.).
  • Expertise with Python and/or C++.
  • Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.).
  • Passion for profiling, identifying bottlenecks, and delivering efficient solutions.

Highly Desirable

  • Track record of successfully scaling ML models.
  • Experience writing custom CUDA kernels or XLA operations.
  • Understanding of GPU/TPU architectures and their implications for efficient ML systems.
  • Fundamentals of modern Deep Learning.
  • Actively following ML trends and a desire to push boundaries.

Example Projects

  • Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
  • Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
  • Adapt algorithms for large-scale distributed architectures across HPC clusters.
  • Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.

What We Offer

  • Real-World Impact: Directly contribute to the performance and reach of our AI solutions.
  • Cutting-Edge Challenges: Tackle complex problems at the forefront of machine learning and large-scale system design.
  • Growth-Oriented Environment: Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.

Benefits
Extracted with AI

  • Real-World Impact
  • Cutting-Edge Challenges
  • Growth-Oriented Environment

Similar jobs

Last update: 23 minutes ago

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

Instapro Group logo
Instapro Group

Senior Backend Engineer - Payments

Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.

Instapro Group logo
Instapro Group

Senior Backend Engineer - PHP, Symfony, Laravel

Join Instapro Group as a Senior Backend Engineer, working with PHP, Symfony, and Laravel in a hybrid environment.

InstaDeep logo
InstaDeep

Senior DevOps Engineer

Join InstaDeep as a Senior DevOps Engineer in Paris, working at the intersection of machine learning and engineering.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

BCG X logo
BCG X

AI Engineer

Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.

ST Engineering iDirect logo
ST Engineering iDirect

Senior Software Engineer - Satellite Communications

Join ST Engineering iDirect as a Senior Software Engineer to develop satellite communication products using C++, Linux, and Agile methodologies.

Skytree logo
Skytree

Senior IoT Engineer

Join Skytree as a Senior IoT Engineer to lead IoT projects, focusing on Azure IoT solutions, edge computing, and data pipelines.

Darktrace logo
Darktrace

Solutions Engineer

Join Darktrace as a Solutions Engineer in Amsterdam, providing technical pre-sales and post-sales support in a hybrid work environment.

InstaDeep logo
InstaDeep

Senior Backend Engineer

Join InstaDeep as a Senior Backend Engineer in Paris, working on cutting-edge AI technologies and backend development.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Aiven logo
Aiven

Senior Software Engineer - Python, Apache Kafka

Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.

Gorgias logo
Gorgias

Senior Full-Stack Engineer ReactJS/NodeJS

Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.

Reddit, Inc. logo
Reddit, Inc.

Senior Solutions Engineer

Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.

Ilkari logo
Ilkari

Senior Software Engineer - Python, Django, Angular

Join Ilkari as a Senior Software Engineer to lead development in Python, Django, and Angular, creating scalable solutions in a hybrid work environment.

Zalando logo
Zalando

Backend Software Engineer - Privacy Technology

Join Zalando as a Backend Software Engineer in Privacy Technology, focusing on data protection and privacy automation services.

Together AI logo
Together AI

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

Optiver logo
Optiver

Production Engineer

Join Optiver as a Production Engineer in Amsterdam to manage live trading environments and enhance system reliability and performance.

Uber logo
Uber

Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering

Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.

Catalyze Group logo
Catalyze Group

Full Stack Developer with AI and API Expertise

Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.

Greener Power Solutions logo
Greener Power Solutions

Freelance DevOps Engineer with Python Expertise

Join Greener Power Solutions as a Freelance DevOps Engineer to drive sustainable energy solutions with Python and DevOps expertise.

Uber logo
Uber

Senior Software Engineer: Configuration Management/Deployment

Join Uber's Amsterdam team as a Senior Software Engineer focusing on configuration management and deployment. Solve infrastructure challenges at scale.