Scale AI logo

Senior Software Engineer, Machine Learning Infrastructure

Scale AI

Job Overview

Scale AI is seeking a Senior Software Engineer to join our Machine Learning Infrastructure team. This role involves building and optimizing our Training Platform, working closely with Machine Learning researchers to enhance experimentation throughput. The ideal candidate will have a strong foundation in machine learning, backend system design, and prior experience in ML Infrastructure.

Key Responsibilities

  • Develop highly available, observable, performant, and cost-effective APIs for model training.
  • Participate in the team’s on-call process to ensure service availability.
  • Manage projects end-to-end, from requirements gathering to implementation, in a collaborative environment.
  • Make informed decisions on build vs. buy tradeoffs, focusing on cost efficiency.

Required Skills and Experience

  • 4+ years of experience in building machine learning training pipelines or inference services in production.
  • Proficiency in distributed training techniques such as DeepSpeed and FSDP.
  • Experience in building, deploying, and monitoring complex microservice architectures.
  • Strong skills in Python, Docker, Kubernetes, and Infrastructure as Code (e.g., Terraform).

Nice to Have

  • Experience with LLM inference latency optimization techniques, such as kernel fusion, quantization, and dynamic batching.
  • Familiarity with cloud technology stacks like AWS or GCP.

Compensation and Benefits

  • Base salary range: $160,000—$225,600 USD
  • Equity-based compensation, subject to Board approval
  • Comprehensive health, dental, and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Potential additional benefits such as a commuter stipend

About Scale AI

At Scale, we are committed to accelerating the development of AI applications. Our mission is to transition from traditional software to AI across industries, transforming how organizations build and deploy AI. We power the world's most advanced LLMs, generative models, and computer vision models, trusted by companies like OpenAI, Meta, and Microsoft.

Scale AI is an affirmative action employer and an inclusive and equal opportunity workplace. We are committed to providing reasonable accommodations to applicants with disabilities. If you need assistance, please contact us.

Join us in our mission to unlock the value of AI and transform industries worldwide.

Benefits
Extracted with AI

  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Commuter stipend

Similar jobs

Last update: 23 minutes ago

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Together AI logo
Together AI

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

Sysdig logo
Sysdig

Senior Software Engineer - Backend Development

Join Sysdig as a Senior Software Engineer to develop scalable backend services using Go, RESTful APIs, and microservices in a hybrid work environment.

Zalando logo
Zalando

Senior Backend/Data Engineer

Join Zalando as a Senior Backend/Data Engineer in Berlin to enhance our audience-building platform using AWS, Java, Scala, and SQL.

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

Catalyze Group logo
Catalyze Group

Full Stack Developer with AI and API Expertise

Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

Nebius AI logo
Nebius AI

Senior Backend Engineer (Go)

Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.

Aiven logo
Aiven

Senior Software Engineer - Python, Apache Kafka

Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.

Redcare Pharmacy logo
Redcare Pharmacy

Senior DevOps Engineer with Linux, Kubernetes, and GCP

Join Redcare Pharmacy as a Senior DevOps Engineer to enhance infrastructure efficiency using Linux, Kubernetes, and GCP.

Skytree logo
Skytree

Senior IoT Engineer

Join Skytree as a Senior IoT Engineer to lead IoT projects, focusing on Azure IoT solutions, edge computing, and data pipelines.

Ilkari logo
Ilkari

Senior Software Engineer - Python, Django, Angular

Join Ilkari as a Senior Software Engineer to lead development in Python, Django, and Angular, creating scalable solutions in a hybrid work environment.

Personio logo
Personio

Staff Software Engineer, Data Platform

Join Personio as a Staff Software Engineer in Berlin to build scalable data platforms using Kafka, Kubernetes, and AWS. Drive innovation and excellence.

Persona logo
Persona

LLM Backend Developer

Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.

netgo logo
netgo

Senior Cloud DevOps Engineer

Join netgo as a Senior Cloud DevOps Engineer in Berlin. Work with Kubernetes, GitOps, and more in a dynamic team environment.

BCG X logo
BCG X

AI Engineer

Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.

Instapro Group logo
Instapro Group

Senior Backend Engineer - PHP, Symfony, Laravel

Join Instapro Group as a Senior Backend Engineer, working with PHP, Symfony, and Laravel in a hybrid environment.

Sofico logo
Sofico

Senior Software Engineer - Java, Microservices

Join Sofico as a Senior Software Engineer focusing on Java and Microservices in Bavaria, Germany. Work on ERP solutions for automotive finance.

Zalando logo
Zalando

Backend Software Engineer - Privacy Technology

Join Zalando as a Backend Software Engineer in Privacy Technology, focusing on data protection and privacy automation services.

Uber logo
Uber

Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering

Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!

Applied Intuition logo
Applied Intuition

Software Engineer - Autonomous Driving

Join Applied Intuition as a Software Engineer in Munich to tackle autonomous driving challenges with top ADAS/AV programs.