Scale AI logo

Senior Software Engineer, Machine Learning Infrastructure

Scale AI

Job Overview

Scale AI is seeking a Senior Software Engineer to join our Machine Learning Infrastructure team. This role involves building and optimizing our Training Platform, working closely with Machine Learning researchers to enhance experimentation throughput. The ideal candidate will have a strong foundation in machine learning, backend system design, and prior experience in ML Infrastructure.

Key Responsibilities

  • Develop highly available, observable, performant, and cost-effective APIs for model training.
  • Participate in the team’s on-call process to ensure service availability.
  • Manage projects end-to-end, from requirements gathering to implementation, in a collaborative environment.
  • Make informed decisions on build vs. buy tradeoffs, focusing on cost efficiency.

Required Skills and Experience

  • 4+ years of experience in building machine learning training pipelines or inference services in production.
  • Proficiency in distributed training techniques such as DeepSpeed and FSDP.
  • Experience in building, deploying, and monitoring complex microservice architectures.
  • Strong skills in Python, Docker, Kubernetes, and Infrastructure as Code (e.g., Terraform).

Nice to Have

  • Experience with LLM inference latency optimization techniques, such as kernel fusion, quantization, and dynamic batching.
  • Familiarity with cloud technology stacks like AWS or GCP.

Compensation and Benefits

  • Base salary range: $160,000—$225,600 USD
  • Equity-based compensation, subject to Board approval
  • Comprehensive health, dental, and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Potential additional benefits such as a commuter stipend

About Scale AI

At Scale, we are committed to accelerating the development of AI applications. Our mission is to transition from traditional software to AI across industries, transforming how organizations build and deploy AI. We power the world's most advanced LLMs, generative models, and computer vision models, trusted by companies like OpenAI, Meta, and Microsoft.

Scale AI is an affirmative action employer and an inclusive and equal opportunity workplace. We are committed to providing reasonable accommodations to applicants with disabilities. If you need assistance, please contact us.

Join us in our mission to unlock the value of AI and transform industries worldwide.

Benefits
Extracted with AI

  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Commuter stipend

Similar jobs

Last update: 23 minutes ago

Scale AI logo
Scale AI

Senior Software Engineer, GenAI Safety & Evaluation

Senior Software Engineer role in AI safety and evaluation, requiring skills in AI, backend development, and system performance.

Scale AI logo
Scale AI

Senior Fullstack Software Engineer, GenAI Horizontal Task Tooling

Join Scale AI as a Senior Fullstack Software Engineer to build web-based applications for AI data annotation.

Scale AI logo
Scale AI

Senior Software Engineer, GenAI Safety & Evaluation

Join Scale AI as a Senior Software Engineer in GenAI Safety & Evaluation, shaping AI model evaluation.

Scale AI logo
Scale AI

Senior Fullstack Software Engineer, GenAI Allocation

Senior Fullstack Engineer role focusing on AI applications, requiring AWS, OOP, and problem-solving skills.

Scale AI logo
Scale AI

Fullstack Software Engineer, GenAI Growth

Join Scale AI as a Fullstack Software Engineer to build and optimize GenAI growth products.

Scale AI logo
Scale AI

Senior Software Engineer, Generative AI

Senior Software Engineer for Generative AI at Scale AI, focusing on backend development and AI technologies.

Scale AI logo
Scale AI

Senior Fullstack Software Engineer, Pay & Incentives

Senior Fullstack Engineer role focusing on payment systems, requiring skills in both front-end and back-end development.

Scale AI logo
Scale AI

Senior Software Engineer, GenAI Model Evaluation

Join Scale AI as a Senior Software Engineer in GenAI Model Evaluation, focusing on AI model safety and performance.

Scale AI logo
Scale AI

Software Engineer, GenAI Growth

Join Scale AI as a Software Engineer in GenAI Growth, focusing on AI applications and platform growth.

Scale AI logo
Scale AI

Senior Software Engineer, Generative AI

Senior Software Engineer role at Scale AI, focusing on Generative AI technologies, requiring skills in Python, Node.js, React, and more.

Scale AI logo
Scale AI

Software Engineer, GenAI Growth

Join Scale AI as a Software Engineer in GenAI Growth, developing AI applications with a dynamic team in NYC.

Scale AI logo
Scale AI

Senior Software Engineer - Generative AI Operator

Senior Software Engineer for Generative AI, focusing on building advanced LLMs and generative models with technologies like MongoDB, React, and Node.js.

Scale AI logo
Scale AI

Machine Learning Engineer, Violations

Join Scale AI as a Machine Learning Engineer in San Francisco, focusing on violations and fraud detection using advanced AI.

Scale AI logo
Scale AI logo
Scale AI

Software Engineer - New Grad

Join Scale AI as a Software Engineer - New Grad in San Francisco. Work on AI applications with TypeScript, MongoDB, and more.

Scale AI logo
Scale AI

Lead Software Engineer, Generative AI

Lead Software Engineer role at Scale AI, focusing on generative AI, requiring skills in Python, Next.js, and AI technologies.

Scale AI logo
Scale AI

Machine Learning Research Engineer - New Grad

Join Scale AI as a Machine Learning Research Engineer - New Grad, focusing on AI, ML, and data science.

Scale AI logo
Scale AI

Senior Software Engineer - Frontend, Generative AI

Senior Software Engineer for Frontend in Generative AI at Scale AI, focusing on React.js, Node.js, and performance optimization.

Scale AI logo
Scale AI

Senior Prompt Engineer

Senior Prompt Engineer at Scale AI, specializing in LLM applications, data analysis, and AI model development.

Scale AI logo
Scale AI

Machine Learning Research Engineering Intern (Summer 2025)

Join Scale AI as a Machine Learning Research Engineering Intern for Summer 2025, focusing on AI and ML solutions.

CHAI: AI Platform logo
CHAI: AI Platform

Senior ML Infrastructure Engineer

Join CHAI: AI Platform as a Senior ML Infrastructure Engineer to build and scale ML systems in Palo Alto.

Scale AI logo
Scale AI

Tech Lead, Software Engineer, GenAI Fraud

Join Scale AI as a Tech Lead, Software Engineer in GenAI Fraud, focusing on cutting-edge fraud prevention solutions.

Scale AI logo
Scale AI

Fullstack Software Engineer, GenAI Quality

Fullstack Software Engineer role focusing on GenAI Quality, involving end-to-end feature development and system design.

Scale AI logo
Scale AI

Software Engineering Intern (Summer 2025)

Join Scale AI as a Software Engineering Intern for Summer 2025, working on AI applications with Python, TypeScript, and MongoDB.