About the Team
Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large-scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.
The Role
We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.
Responsibilities
- Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
- Performance Optimization: Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimizations.
- Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
- Algorithmic Optimization: Research and understand the latest deep learning literature to implement and optimize state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
- Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.
Required Skills
- Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
- Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.).
- Expertise with Python and/or C++.
- Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.).
- Passion for profiling, identifying bottlenecks, and delivering efficient solutions.
Highly Desirable
- Track record of successfully scaling ML models.
- Experience writing custom CUDA kernels or XLA operations.
- Understanding of GPU/TPU architectures and their implications for efficient ML systems.
- Fundamentals of modern Deep Learning.
- Actively following ML trends and a desire to push boundaries.
Example Projects
- Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
- Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
- Adapt algorithms for large-scale distributed architectures across HPC clusters.
- Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.
What We Offer
- Real-World Impact: Directly contribute to the performance and reach of our AI solutions.
- Cutting-Edge Challenges: Tackle complex problems at the forefront of machine learning and large-scale system design.
- Growth-Oriented Environment: Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.
Benefits Extracted with AI
- Real-World Impact
- Cutting-Edge Challenges
- Growth-Oriented Environment
Similar jobs
Last update: 23 minutes ago
Senior C++ Computer Vision Engineer
Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Expert Machine Learning Engineer
Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.
Senior Backend Engineer - Payments
Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.
Senior Backend Engineer - PHP, Symfony, Laravel
Join Instapro Group as a Senior Backend Engineer, working with PHP, Symfony, and Laravel in a hybrid environment.
Senior DevOps Engineer
Join InstaDeep as a Senior DevOps Engineer in Paris, working at the intersection of machine learning and engineering.
Senior Fullstack Developer for AI-Driven Mission Technologies
Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.
AI Engineer
Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.
Senior Software Engineer - Satellite Communications
Join ST Engineering iDirect as a Senior Software Engineer to develop satellite communication products using C++, Linux, and Agile methodologies.
Senior IoT Engineer
Join Skytree as a Senior IoT Engineer to lead IoT projects, focusing on Azure IoT solutions, edge computing, and data pipelines.
Solutions Engineer
Join Darktrace as a Solutions Engineer in Amsterdam, providing technical pre-sales and post-sales support in a hybrid work environment.
Senior Backend Engineer
Join InstaDeep as a Senior Backend Engineer in Paris, working on cutting-edge AI technologies and backend development.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Senior Software Engineer - Python, Apache Kafka
Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.
Senior Full-Stack Engineer ReactJS/NodeJS
Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.
Senior Solutions Engineer
Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.
Senior Software Engineer - Python, Django, Angular
Join Ilkari as a Senior Software Engineer to lead development in Python, Django, and Angular, creating scalable solutions in a hybrid work environment.
Backend Software Engineer - Privacy Technology
Join Zalando as a Backend Software Engineer in Privacy Technology, focusing on data protection and privacy automation services.
Senior Backend Engineer - Java, Rust, Go
Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.
Production Engineer
Join Optiver as a Production Engineer in Amsterdam to manage live trading environments and enhance system reliability and performance.
Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering
Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.
Full Stack Developer with AI and API Expertise
Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.
Freelance DevOps Engineer with Python Expertise
Join Greener Power Solutions as a Freelance DevOps Engineer to drive sustainable energy solutions with Python and DevOps expertise.
Senior Software Engineer: Configuration Management/Deployment
Join Uber's Amsterdam team as a Senior Software Engineer focusing on configuration management and deployment. Solve infrastructure challenges at scale.