About the Team
Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large-scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.
The Role
We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.
Responsibilities
- Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
- Performance Optimization: Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimizations.
- Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
- Algorithmic Optimization: Research and understand the latest deep learning literature to implement and optimize state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
- Low-Level Mastery: Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.
Required Skills
- Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
- Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.).
- Expertise with Python and/or C++.
- Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.).
- Passion for profiling, identifying bottlenecks, and delivering efficient solutions.
Highly Desirable
- Track record of successfully scaling ML models.
- Experience writing custom CUDA kernels or XLA operations.
- Understanding of GPU/TPU architectures and their implications for efficient ML systems.
- Fundamentals of modern Deep Learning.
- Actively following ML trends and a desire to push boundaries.
Example Projects
- Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
- Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
- Adapt algorithms for large-scale distributed architectures across HPC clusters.
- Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.
What We Offer
- Real-World Impact: Directly contribute to the performance and reach of our AI solutions.
- Cutting-Edge Challenges: Tackle complex problems at the forefront of machine learning and large-scale system design.
- Growth-Oriented Environment: Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.
Benefits Extracted with AI
- Real-World Impact
- Cutting-Edge Challenges
- Growth-Oriented Environment
Similar jobs
Last update: 23 minutes ago
Senior DevOps Engineer
Join InstaDeep as a Senior DevOps Engineer in Paris, working at the intersection of machine learning and engineering.
Senior Machine Learning Performance Engineer
Join Wayve as a Senior Machine Learning Performance Engineer to optimize large-scale training jobs and improve GPU efficiency.
Next-Gen Deep Learning Engineer
Join Zenseact as a Deep Learning Engineer in Munich. Work on cutting-edge AI for autonomous driving.
Senior Computer Vision Engineer
Join Meero as a Senior Computer Vision Engineer in Paris. Work on AI-driven image processing for e-commerce with cutting-edge technologies.
Backend Engineer with Machine Learning Focus
Join Accrete AI as a Backend Engineer with a focus on machine learning, building scalable AI solutions.
Senior Machine Learning Engineer
Join Intuit as a Senior Machine Learning Engineer to develop and deploy scalable data science models.
Senior Machine Learning Engineer
Join Bloomreach as a Senior Machine Learning Engineer to design and implement AI-driven components for personalized digital experiences.
Senior Machine Learning Engineer, Marketing Technology
Join Airbnb as a Senior Machine Learning Engineer to enhance personalized customer experiences using AI/ML.
Senior Machine Learning Engineer
Join Intuit as a Senior Machine Learning Engineer to develop and deploy data science models at scale using cutting-edge tools.
Software Engineer 2 - AI and Machine Learning
Join Microsoft as a Software Engineer 2 in Barcelona to innovate in AI and Machine Learning with a focus on large-scale data projects.
Lead Data Engineer
Join Partoo as a Lead Data Engineer in Paris, managing data pipelines, AI projects, and a team, with a focus on innovation and data security.
Lead Engineering Manager AI
Lead Engineering Manager AI role in Paris, focusing on AI-driven image production for e-commerce with Meero's AutoRetouch product line.
AI Engineer with Computer Vision and NLP Expertise
Join Algorhythm as an AI Engineer to develop cutting-edge AI solutions with a focus on Computer Vision and NLP.
Senior Software Engineer (Machine Learning & Infrastructure)
Join Adobe's AI team as a Senior Software Engineer to enhance Acrobat AI Assistant features, focusing on backend infrastructure and machine learning.
Principal Software Engineer - AI Platform
Join Accrete AI as a Principal Software Engineer to lead AI platform development, leveraging AI/ML frameworks and cloud technologies.
Senior MLOps Engineer
Join Intapp as a Senior MLOps Engineer to design, build, and maintain secure, scalable ML platforms. Remote position in Portugal.
Machine Learning Engineer (NLP)
Join Intapp as a Machine Learning Engineer (NLP) in Berlin. Work on AI/ML solutions, leveraging NLP technologies. Competitive salary and benefits.
Senior Deep Learning Engineer
Join Owl.co as a Senior Deep Learning Engineer to develop AI solutions for insurance claims in New York County, NY.
Software Engineer, YouTube Knowledge
Join Google as a Software Engineer on the YouTube Knowledge team in Paris, working on large-scale systems and machine learning.
Senior Applied Scientist - Machine Learning and Data Science
Join Amazon as a Senior Applied Scientist in Palo Alto, focusing on machine learning and data science for advertising.
Senior Software Engineer, Machine Learning
Join as a Senior Software Engineer in Machine Learning, working remotely to build ML-driven products for user engagement.
Senior Machine Learning Engineer
Join Overjet as a Senior Machine Learning Engineer to lead AI/ML model development and deployment in dental care.
Senior Software Development Engineer, Applied AI
Join Amazon's Applied AI team as a Senior Software Development Engineer to innovate with AI technologies.
Senior Software Engineer - Python and Cloud
Join Vectra AI as a Senior Software Engineer in Dublin, focusing on Python, cloud, and cybersecurity.