Job Overview
Scale AI is seeking a Senior Software Engineer to join our Machine Learning Infrastructure team. This role involves building and optimizing our Training Platform, working closely with Machine Learning researchers to enhance experimentation throughput. The ideal candidate will have a strong foundation in machine learning, backend system design, and prior experience in ML Infrastructure.
Key Responsibilities
- Develop highly available, observable, performant, and cost-effective APIs for model training.
- Participate in the team’s on-call process to ensure service availability.
- Manage projects end-to-end, from requirements gathering to implementation, in a collaborative environment.
- Make informed decisions on build vs. buy tradeoffs, focusing on cost efficiency.
Required Skills and Experience
- 4+ years of experience in building machine learning training pipelines or inference services in production.
- Proficiency in distributed training techniques such as DeepSpeed and FSDP.
- Experience in building, deploying, and monitoring complex microservice architectures.
- Strong skills in Python, Docker, Kubernetes, and Infrastructure as Code (e.g., Terraform).
Nice to Have
- Experience with LLM inference latency optimization techniques, such as kernel fusion, quantization, and dynamic batching.
- Familiarity with cloud technology stacks like AWS or GCP.
Compensation and Benefits
- Base salary range: $160,000—$225,600 USD
- Equity-based compensation, subject to Board approval
- Comprehensive health, dental, and vision coverage
- Retirement benefits
- Learning and development stipend
- Generous PTO
- Potential additional benefits such as a commuter stipend
About Scale AI
At Scale, we are committed to accelerating the development of AI applications. Our mission is to transition from traditional software to AI across industries, transforming how organizations build and deploy AI. We power the world's most advanced LLMs, generative models, and computer vision models, trusted by companies like OpenAI, Meta, and Microsoft.
Scale AI is an affirmative action employer and an inclusive and equal opportunity workplace. We are committed to providing reasonable accommodations to applicants with disabilities. If you need assistance, please contact us.
Join us in our mission to unlock the value of AI and transform industries worldwide.
Benefits Extracted with AI
- Comprehensive health, dental and vision coverage
- Retirement benefits
- Learning and development stipend
- Generous PTO
- Commuter stipend
Similar jobs
Last update: 23 minutes ago
Senior Software Engineer, GenAI Safety & Evaluation
Senior Software Engineer role in AI safety and evaluation, requiring skills in AI, backend development, and system performance.
Senior Fullstack Software Engineer, GenAI Horizontal Task Tooling
Join Scale AI as a Senior Fullstack Software Engineer to build web-based applications for AI data annotation.
Senior Software Engineer, GenAI Safety & Evaluation
Join Scale AI as a Senior Software Engineer in GenAI Safety & Evaluation, shaping AI model evaluation.
Senior Fullstack Software Engineer, GenAI Allocation
Senior Fullstack Engineer role focusing on AI applications, requiring AWS, OOP, and problem-solving skills.
Fullstack Software Engineer, GenAI Growth
Join Scale AI as a Fullstack Software Engineer to build and optimize GenAI growth products.
Senior Software Engineer, Generative AI
Senior Software Engineer for Generative AI at Scale AI, focusing on backend development and AI technologies.
Senior Fullstack Software Engineer, Pay & Incentives
Senior Fullstack Engineer role focusing on payment systems, requiring skills in both front-end and back-end development.
Senior Software Engineer, GenAI Model Evaluation
Join Scale AI as a Senior Software Engineer in GenAI Model Evaluation, focusing on AI model safety and performance.
Software Engineer, GenAI Growth
Join Scale AI as a Software Engineer in GenAI Growth, focusing on AI applications and platform growth.
Senior Software Engineer, Generative AI
Senior Software Engineer role at Scale AI, focusing on Generative AI technologies, requiring skills in Python, Node.js, React, and more.
Software Engineer, GenAI Growth
Join Scale AI as a Software Engineer in GenAI Growth, developing AI applications with a dynamic team in NYC.
Senior Software Engineer - Generative AI Operator
Senior Software Engineer for Generative AI, focusing on building advanced LLMs and generative models with technologies like MongoDB, React, and Node.js.
Machine Learning Engineer, Violations
Join Scale AI as a Machine Learning Engineer in San Francisco, focusing on violations and fraud detection using advanced AI.
Manager, Machine Learning Research Engineer, Generative AI
Lead a team in developing generative AI models at Scale AI, focusing on LLMs and MLOps.
Software Engineer - New Grad
Join Scale AI as a Software Engineer - New Grad in San Francisco. Work on AI applications with TypeScript, MongoDB, and more.
Lead Software Engineer, Generative AI
Lead Software Engineer role at Scale AI, focusing on generative AI, requiring skills in Python, Next.js, and AI technologies.
Machine Learning Research Engineer - New Grad
Join Scale AI as a Machine Learning Research Engineer - New Grad, focusing on AI, ML, and data science.
Senior Software Engineer - Frontend, Generative AI
Senior Software Engineer for Frontend in Generative AI at Scale AI, focusing on React.js, Node.js, and performance optimization.
Senior Prompt Engineer
Senior Prompt Engineer at Scale AI, specializing in LLM applications, data analysis, and AI model development.
Machine Learning Research Engineering Intern (Summer 2025)
Join Scale AI as a Machine Learning Research Engineering Intern for Summer 2025, focusing on AI and ML solutions.
Senior ML Infrastructure Engineer
Join CHAI: AI Platform as a Senior ML Infrastructure Engineer to build and scale ML systems in Palo Alto.
Tech Lead, Software Engineer, GenAI Fraud
Join Scale AI as a Tech Lead, Software Engineer in GenAI Fraud, focusing on cutting-edge fraud prevention solutions.
Fullstack Software Engineer, GenAI Quality
Fullstack Software Engineer role focusing on GenAI Quality, involving end-to-end feature development and system design.
Software Engineering Intern (Summer 2025)
Join Scale AI as a Software Engineering Intern for Summer 2025, working on AI applications with Python, TypeScript, and MongoDB.