Stability AI logo

Lead Architect - Gen AI API Platform

Stability AI

About the Role

We are seeking a Lead Architect to spearhead the architecture, design, and development of our next-generation Gen AI API platform. This platform supports multiple modalities, including image, video, language, 3D, and audio. The ideal candidate will have extensive experience in architecting and building REST APIs, hosting AI/ML workflows on HPC clusters, setting up AWS infrastructure, and mentoring junior developers.

Responsibilities

  • Serve as the technical lead to drive architecture, design, and development of AI/ML SaaS services.
  • Set up inference on an HPC cluster for multiple modalities on a common Gen AI platform.
  • Build robust application backends on AWS infrastructure that support highly available AI/ML services at high scale, efficiently utilizing GPU clusters.
  • Define comprehensive API specifications and documentation.
  • Deliver customer-facing services, including account management, identity, single-sign-on, subscription billing, and self-service support tools, integrating with existing internal systems where necessary.
  • Collaborate with stakeholders such as the frontend team, product managers, and technical leadership to implement new features.
  • Lead system architecture design and decisions, helping drive consensus.
  • Manage large compute clusters for ML inference and development.
  • Deliver and manage our developer and researcher productivity tools, including CI/CD pipelines for deploying new machine learning models, orchestration, continuous/progressive deployments, test environments, feature flags, and GitHub.
  • Own the orchestration, deployments, middleware, and any other microservices required to meet the needs of our API customers.

Qualifications

  • 10+ years of experience in building REST APIs and backend infrastructure on AWS.
  • Experienced in designing and building AI/ML infrastructure and working with large GPU clusters, preferably in multiple modalities like image, video, audio, and language.
  • Distributed system architecture design knowledge and experience with delivering high traffic and highly available SaaS type services.
  • Well-versed in data structures, data modeling, and database management systems, billing and metering, as well as object and file storage systems.
  • Experienced in mentoring other engineers and collaborating with multiple stakeholders.
  • Experienced in root cause analysis and driving operational excellence initiatives of AI/ML services.
  • Highly proficient in Python and Typescript.

Compensation

The salary range for this role is between $190,000 and $250,000. Total compensation also includes stock options and benefits.

Equal Employment Opportunity

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.

Benefits
Extracted with AI

  • Stock options
  • Remote work flexibility

Similar jobs

Last update: 23 minutes ago

Stability AI logo
Stability AI

Senior Backend Engineer (AI)

Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.

Stability AI logo
Stability AI

Senior Data Platform Engineer

Senior Data Platform Engineer specializing in AWS and GCP services, data pipelines, and cloud infrastructure.

Nintex logo
Nintex

AI Architect

Lead AI Architect role focusing on generative AI, computer vision, and machine learning in a transformative work environment.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Worldwide Specialist Solutions Architect - GenAI/ML

Join AWS as a Specialist Solutions Architect in GenAI/ML, leveraging AWS services to design scalable solutions.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Deep Learning Architect, Generative AI

Join AWS as a Deep Learning Architect in Rome to innovate with Generative AI and transform business opportunities.

GlobalLogic logo
GlobalLogic

Senior AI/ML Architect

Lead AI/ML architecture at GlobalLogic, designing scalable ML models and infrastructure. Requires 6+ years experience, Python, cloud platforms.

Microsoft logo
Microsoft

Cloud Solution Architect - Generative AI

Join Microsoft as a Cloud Solution Architect focusing on Generative AI, driving innovation and AI transformation on the Microsoft platform.

Wellhub logo
Wellhub

Lead Software Engineer - GenAI

Join Wellhub as a Lead Software Engineer in GenAI, focusing on AI development, API integration, and leadership in a remote role.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Deep Learning Architect, Generative AI

Join AWS as a Senior Deep Learning Architect to innovate with Generative AI and transform business opportunities.

Thoughtful AI logo
Thoughtful AI

Senior Software Engineer, Platform

Join Thoughtful AI as a Senior Software Engineer, Platform. Lead, craft, and empower in a remote role with competitive salary and benefits.

Blend logo
Blend

Fullstack Developer - React, Python, AWS, Generative AI

Remote Fullstack Developer role focusing on React, Python, AWS, and Generative AI for short-term projects.

Stability AI logo
Stability AI

Senior Data Engineer

Join Stability AI as a Senior Data Engineer to build scalable data infrastructure for AI models. Remote work from Germany.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Deep Learning Architect, Generative AI Innovation Center

Join AWS as a Deep Learning Architect in Milan to innovate with Generative AI and transform business opportunities.

Scale AI logo
Scale AI

Senior Fullstack Software Engineer, GenAI Allocation

Senior Fullstack Engineer role focusing on AI applications, requiring AWS, OOP, and problem-solving skills.

FinThrive logo
FinThrive

AI Solutions Architect

Join FinThrive as an AI Solutions Architect to lead AI development and optimization in healthcare technology.

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Deep Learning Architect, AWS Generative AI Innovation Center

Join AWS as a Deep Learning Architect to innovate with Generative AI, solving real-world problems in a fast-paced environment.

webAI logo
webAI

Staff Backend Engineer - Runtime Team Lead

Join webAI as a Staff Backend Engineer to lead the Runtime Team, focusing on distributed systems and high-performance engineering.

Caylent logo
Caylent

Principal Software Architect

Join Caylent as a Principal Software Architect to lead cloud-native projects, engage with clients, and drive innovation using AWS.

Microsoft logo
Microsoft

Cloud Solution Architect - Artificial Intelligence (AI)

Join Microsoft as a Cloud Solution Architect specializing in AI and ML, driving customer transformation on Azure.

micro1 logo
micro1

Senior API Developer with AI and Python Expertise

Join us as a Senior API Developer to build AI-driven solutions using Python and Golang. Work remotely with top-tier companies.

Stability AI logo
Stability AI

Remote Data Engineer - Research

Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.

Scale AI logo
Scale AI

Senior Platform Engineer - Scale GenAI Platform

Senior Platform Engineer needed for Scale GenAI Platform in Budapest. Focus on AI, cloud platforms, and system integration.

Scale AI logo
Scale AI

Fullstack Software Engineer, GenAI Growth

Fullstack Software Engineer role focusing on AI growth, requiring skills in back-end and front-end development, and system design.

Swooped logo
Swooped

Senior Software Engineer, AI

Join as a Senior Software Engineer, AI, to innovate AI features in a remote-friendly environment. Enhance CRM with cutting-edge AI.