About the Team
Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large-scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.
The Role
We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.
Responsibilities
- Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
- Performance Optimization: Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimizations.
- Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
- Algorithmic Optimization: Research and understand the latest deep learning literature to implement and optimize state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
- Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.
Required Skills
- Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
- Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.).
- Expertise with Python and/or C++.
- Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.).
- Passion for profiling, identifying bottlenecks, and delivering efficient solutions.
Highly Desirable
- Track record of successfully scaling ML models.
- Experience writing custom CUDA kernels or XLA operations.
- Understanding of GPU/TPU architectures and their implications for efficient ML systems.
- Fundamentals of modern Deep Learning.
- Actively following ML trends and a desire to push boundaries.
Example Projects
- Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
- Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
- Adapt algorithms for large-scale distributed architectures across HPC clusters.
- Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.
What We Offer
- Real-World Impact: Directly contribute to the performance and reach of our AI solutions.
- Cutting-Edge Challenges: Tackle complex problems at the forefront of machine learning and large-scale system design.
- Growth-Oriented Environment: Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.
Benefits Extracted with AI
- Real-World Impact
- Cutting-Edge Challenges
- Growth-Oriented Environment
Similar jobs
Last update: 23 minutes ago
Senior DevOps Engineer
Join InstaDeep as a Senior DevOps Engineer in Paris, working at the intersection of machine learning and engineering.
Senior Backend Engineer
Join InstaDeep as a Senior Backend Engineer in Paris, working on cutting-edge AI technologies and backend development.
Senior AI Engineer - Machine Learning & Deep Learning
Senior AI Engineer specializing in Machine Learning and Deep Learning for healthcare technology in Belgium.
Senior Machine Learning & Optimization Engineer
Join Index Exchange as a Senior Machine Learning & Optimization Engineer in Berlin, focusing on MLOps, data science, and high-performance computing.
Senior Software Engineer, Machine Learning Infrastructure
Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.
Senior Machine Learning Performance Engineer
Join Wayve as a Senior Machine Learning Performance Engineer to optimize large-scale training jobs and improve GPU efficiency.
Senior Integration Software Engineer
Join FlexAI as a Senior Integration Software Engineer to revolutionize AI infrastructure with cutting-edge integration solutions.
Senior Machine Learning Engineer
Join Echo Analytics as a Senior Machine Learning Engineer in Paris. Leverage ML to drive data modeling and design intelligent data flows.
Senior Software Engineer - Machine Learning and Data Science
Join EyePop.ai as a Senior Software Engineer to develop and scale machine learning and data science software pipelines.
Senior Deep Learning Engineer
Join NVIDIA as a Senior Deep Learning Engineer to optimize AI performance using PyTorch, TensorFlow, and more in Berlin.
Senior Machine Learning Engineer - AI Program
Join Ubisoft as a Senior Machine Learning Engineer to innovate AI projects. Work with cutting-edge technologies in a hybrid environment.
Senior Software Engineer, LLM/VLM
Senior Software Engineer specializing in LLM/VLM for robotics integration in Paris. Deep Learning, MLOps, Embedded Systems expertise required.
Senior Applied AI Engineer
Join Front as a Senior Applied AI Engineer in Paris to develop and implement generative AI applications, enhancing product features and user experiences.
Senior Machine Learning Engineer
Senior Machine Learning Engineer role in Milan, Italy. In-depth experience with AI, ML, Python, AWS, and MLOps required.
Software Engineer - AI & Machine Learning
Join Dataiku as a Software Engineer in AI & Machine Learning, working with Java, Scala, and Angular in a remote role.
AI Benchmark Engineer
Join SiPearl as an AI Benchmark Engineer to optimize AI applications and frameworks for high-performance computing.
Senior Machine Learning Engineer
Join Intuit as a Senior Machine Learning Engineer to innovate and scale AI algorithms in San Diego.
Senior Machine Learning Engineer
Join In The Pocket as a Senior Machine Learning Engineer to scale AI applications, focusing on MLOps and NLP, in Bucharest.
Senior Deep Learning Performance Software Engineer
Senior role optimizing deep learning performance at NVIDIA, involving Python, HPC, and AI technologies.
Senior Software Engineer, AI Inference
Senior AI Inference Engineer specializing in backend development and optimization techniques for high-performance computing.
Machine Learning Engineer - LLM Fine-tuning and Performance
Join NVIDIA as a Machine Learning Engineer specializing in LLM fine-tuning and performance optimization. Work with cutting-edge ML technologies.
Senior AI Security Engineer
Senior AI Security Engineer role focusing on AI system security, vulnerability management, and research in Paris.
Senior Machine Learning Engineer
Join AXA Group Operations as a Senior Machine Learning Engineer in Paris, leveraging AI to innovate and protect customers.
Senior AI Engineer - Machine Learning & Deep Learning
Senior AI Engineer role focusing on Machine Learning and Deep Learning in Budapest, Hungary. Hybrid work, competitive benefits.