About the Role
We are seeking a Lead Architect to spearhead the architecture, design, and development of our next-generation Gen AI API platform. This platform supports multiple modalities, including image, video, language, 3D, and audio. The ideal candidate will have extensive experience in architecting and building REST APIs, hosting AI/ML workflows on HPC clusters, setting up AWS infrastructure, and mentoring junior developers.
Responsibilities
- Serve as the technical lead to drive architecture, design, and development of AI/ML SaaS services.
- Set up inference on an HPC cluster for multiple modalities on a common Gen AI platform.
- Build robust application backends on AWS infrastructure that support highly available AI/ML services at high scale, efficiently utilizing GPU clusters.
- Define comprehensive API specifications and documentation.
- Deliver customer-facing services, including account management, identity, single-sign-on, subscription billing, and self-service support tools, integrating with existing internal systems where necessary.
- Collaborate with stakeholders such as the frontend team, product managers, and technical leadership to implement new features.
- Lead system architecture design and decisions, helping drive consensus.
- Manage large compute clusters for ML inference and development.
- Deliver and manage our developer and researcher productivity tools, including CI/CD pipelines for deploying new machine learning models, orchestration, continuous/progressive deployments, test environments, feature flags, and GitHub.
- Own the orchestration, deployments, middleware, and any other microservices required to meet the needs of our API customers.
Qualifications
- 10+ years of experience in building REST APIs and backend infrastructure on AWS.
- Experienced in designing and building AI/ML infrastructure and working with large GPU clusters, preferably in multiple modalities like image, video, audio, and language.
- Distributed system architecture design knowledge and experience with delivering high traffic and highly available SaaS type services.
- Well-versed in data structures, data modeling, and database management systems, billing and metering, as well as object and file storage systems.
- Experienced in mentoring other engineers and collaborating with multiple stakeholders.
- Experienced in root cause analysis and driving operational excellence initiatives of AI/ML services.
- Highly proficient in Python and Typescript.
Compensation
The salary range for this role is between $190,000 and $250,000. Total compensation also includes stock options and benefits.
Equal Employment Opportunity
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.
Benefits Extracted with AI
- Stock options
- Remote work flexibility
Similar jobs
Last update: 23 minutes ago
Senior Backend Engineer (AI)
Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.
Senior Data Platform Engineer
Senior Data Platform Engineer specializing in AWS and GCP services, data pipelines, and cloud infrastructure.
AI Architect
Lead AI Architect role focusing on generative AI, computer vision, and machine learning in a transformative work environment.
Worldwide Specialist Solutions Architect - GenAI/ML
Join AWS as a Specialist Solutions Architect in GenAI/ML, leveraging AWS services to design scalable solutions.
Deep Learning Architect, Generative AI
Join AWS as a Deep Learning Architect in Rome to innovate with Generative AI and transform business opportunities.
Senior AI/ML Architect
Lead AI/ML architecture at GlobalLogic, designing scalable ML models and infrastructure. Requires 6+ years experience, Python, cloud platforms.
Cloud Solution Architect - Generative AI
Join Microsoft as a Cloud Solution Architect focusing on Generative AI, driving innovation and AI transformation on the Microsoft platform.
Lead Software Engineer - GenAI
Join Wellhub as a Lead Software Engineer in GenAI, focusing on AI development, API integration, and leadership in a remote role.
Senior Deep Learning Architect, Generative AI
Join AWS as a Senior Deep Learning Architect to innovate with Generative AI and transform business opportunities.
Senior Software Engineer, Platform
Join Thoughtful AI as a Senior Software Engineer, Platform. Lead, craft, and empower in a remote role with competitive salary and benefits.
Fullstack Developer - React, Python, AWS, Generative AI
Remote Fullstack Developer role focusing on React, Python, AWS, and Generative AI for short-term projects.
Senior Data Engineer
Join Stability AI as a Senior Data Engineer to build scalable data infrastructure for AI models. Remote work from Germany.
Deep Learning Architect, Generative AI Innovation Center
Join AWS as a Deep Learning Architect in Milan to innovate with Generative AI and transform business opportunities.
Senior Fullstack Software Engineer, GenAI Allocation
Senior Fullstack Engineer role focusing on AI applications, requiring AWS, OOP, and problem-solving skills.
AI Solutions Architect
Join FinThrive as an AI Solutions Architect to lead AI development and optimization in healthcare technology.
Deep Learning Architect, AWS Generative AI Innovation Center
Join AWS as a Deep Learning Architect to innovate with Generative AI, solving real-world problems in a fast-paced environment.
Staff Backend Engineer - Runtime Team Lead
Join webAI as a Staff Backend Engineer to lead the Runtime Team, focusing on distributed systems and high-performance engineering.
Principal Software Architect
Join Caylent as a Principal Software Architect to lead cloud-native projects, engage with clients, and drive innovation using AWS.
Cloud Solution Architect - Artificial Intelligence (AI)
Join Microsoft as a Cloud Solution Architect specializing in AI and ML, driving customer transformation on Azure.
Senior API Developer with AI and Python Expertise
Join us as a Senior API Developer to build AI-driven solutions using Python and Golang. Work remotely with top-tier companies.
Remote Data Engineer - Research
Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.
Senior Platform Engineer - Scale GenAI Platform
Senior Platform Engineer needed for Scale GenAI Platform in Budapest. Focus on AI, cloud platforms, and system integration.
Fullstack Software Engineer, GenAI Growth
Fullstack Software Engineer role focusing on AI growth, requiring skills in back-end and front-end development, and system design.
Senior Software Engineer, AI
Join as a Senior Software Engineer, AI, to innovate AI features in a remote-friendly environment. Enhance CRM with cutting-edge AI.