About Stability AI
Stability AI is a community and mission-driven, open artificial intelligence company that cares deeply about real-world implications and applications. Our most considerable advances grow from our diversity in working across multiple teams and disciplines. We are unafraid to go against established norms and explore creativity. We are motivated to generate breakthrough ideas and convert them into tangible solutions. Our vibrant communities consist of experts, leaders, and partners across the globe who are developing cutting-edge open AI models for Image, Language, Audio, Video, and 3D.
Job Description
Stability AI’s Security team is looking for a Site Reliability Engineer (SRE) to help shape our cloud infrastructure. The person will closely work with IT, security, SRE and engineering teams to improve reliability across our environment. Candidates should have the initiative to build and improve a maturing cloud landscape.
Responsibilities
- Implementing and maintaining infrastructure as code using Terraform
- Supporting container orchestration platforms such as Kubernetes or ECS
- Participating in incident management and root cause analysis to improve system reliability
- Contributing to cloud security practices and resource tagging strategies
Qualifications
- Collaborating with development teams to enhance CI/CD pipelines
- Cloud security experience
- Training and working with generative models
- Background in software development or automation scripting
- Knowledge of Grafana, ELK stack, or similar tools
- Involvement in the SRE or DevOps community
Equal Employment Opportunity
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.
Benefits Extracted with AI
- Remote work flexibility
Similar jobs
Last update: 23 minutes ago
Senior Systems Engineer, Managed Operations
Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.
Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Site Reliability Engineer to ensure smooth operation of Hasura Cloud systems, working remotely from India.
Senior Backend Developer (Node.js) / SRE
Join Binance as a Senior Backend Developer (Node.js) / SRE to develop monitoring systems for high-load production environments.
Full Stack Engineer with Node.js and React
Join RightCrowd as a Full Stack Engineer to develop cloud-native applications using Node.js and React. Work remotely with cutting-edge technology.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Senior Cloud Site Reliability Engineer
Senior Cloud Site Reliability Engineer role focusing on enhancing cloud service reliability and efficiency.
Senior Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Senior Site Reliability Engineer to maintain and scale Hasura Cloud. Remote role in the US with competitive salary and benefits.
Senior Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Senior Site Reliability Engineer to maintain and enhance Hasura Cloud's reliability and performance.
Site Reliability Engineer - Enablement
Join Happening as a Site Reliability Engineer to enhance gaming operations' performance and reliability using Kubernetes, Terraform, and more.
Senior Fullstack Developer for AI-Driven Mission Technologies
Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.
Remote Data Engineer - Research
Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.
Senior Backend Engineer (AI)
Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.
Site Reliability Engineer - Delivery: Deployments, North America
Remote Site Reliability Engineer specializing in Delivery: Deployments at GitLab, focusing on improving delivery platforms and tooling.
Senior Site Reliability Engineer
Join Algolia as a Senior Site Reliability Engineer to enhance search product reliability and scalability. Remote work available.
Senior Data Engineer
Join Stability AI as a Senior Data Engineer to build scalable data infrastructure for AI models. Remote work from Germany.
Senior Solutions Engineer
Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.
Senior Software Engineer, Observability
Join OpenAI as a Senior Software Engineer in Observability, ensuring system reliability and scalability in a fast-paced environment.
Senior Cloud DevOps Engineer
Join netgo as a Senior Cloud DevOps Engineer in Berlin. Work with Kubernetes, GitOps, and more in a dynamic team environment.
Senior Data Platform Engineer
Senior Data Platform Engineer specializing in AWS and GCP services, data pipelines, and cloud infrastructure.
FullStack Engineer (Infrastructure Monitoring)
Join Coralogix as a FullStack Engineer to develop scalable solutions for Infrastructure Monitoring. Master Angular, Python, AWS, and more.
SRE Lead at IBM
Lead SRE role at IBM, overseeing system reliability, implementing best practices, and mentoring in New York.
Senior Site Reliability Engineer
Join MongoDB as a Senior Site Reliability Engineer in Berlin to design and build global cloud infrastructure, ensuring reliability and performance.
Senior Full Stack Engineer - Climate Tech - Rust & TypeScript
Join Climatiq as a Senior Full Stack Engineer to develop climate tech solutions using Rust and TypeScript. Remote work available.
Senior DevOps Engineer with Linux, Kubernetes, and GCP
Join Redcare Pharmacy as a Senior DevOps Engineer to enhance infrastructure efficiency using Linux, Kubernetes, and GCP.