Stability AI logo

Site Reliability Engineer (SRE) - Stability AI

Stability AI

About Stability AI

Stability AI is a community and mission-driven, open artificial intelligence company that cares deeply about real-world implications and applications. Our most considerable advances grow from our diversity in working across multiple teams and disciplines. We are unafraid to go against established norms and explore creativity. We are motivated to generate breakthrough ideas and convert them into tangible solutions. Our vibrant communities consist of experts, leaders, and partners across the globe who are developing cutting-edge open AI models for Image, Language, Audio, Video, and 3D.

Job Description

Stability AI’s Security team is looking for a Site Reliability Engineer (SRE) to help shape our cloud infrastructure. The person will closely work with IT, security, SRE and engineering teams to improve reliability across our environment. Candidates should have the initiative to build and improve a maturing cloud landscape.

Responsibilities

  • Implementing and maintaining infrastructure as code using Terraform
  • Supporting container orchestration platforms such as Kubernetes or ECS
  • Participating in incident management and root cause analysis to improve system reliability
  • Contributing to cloud security practices and resource tagging strategies

Qualifications

  • Collaborating with development teams to enhance CI/CD pipelines
  • Cloud security experience
  • Training and working with generative models
  • Background in software development or automation scripting
  • Knowledge of Grafana, ELK stack, or similar tools
  • Involvement in the SRE or DevOps community

Equal Employment Opportunity

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

Benefits
Extracted with AI

  • Remote work flexibility

Similar jobs

Last update: 23 minutes ago

Amazon Web Services (AWS) logo
Amazon Web Services (AWS)

Senior Systems Engineer, Managed Operations

Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.

Hasura logo
Hasura

Site Reliability Engineer (SRE) - Hasura Cloud

Join Hasura as a Site Reliability Engineer to ensure smooth operation of Hasura Cloud systems, working remotely from India.

Binance logo
Binance

Senior Backend Developer (Node.js) / SRE

Join Binance as a Senior Backend Developer (Node.js) / SRE to develop monitoring systems for high-load production environments.

RightCrowd logo
RightCrowd

Full Stack Engineer with Node.js and React

Join RightCrowd as a Full Stack Engineer to develop cloud-native applications using Node.js and React. Work remotely with cutting-edge technology.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

NICE logo
NICE

Senior Cloud Site Reliability Engineer

Senior Cloud Site Reliability Engineer role focusing on enhancing cloud service reliability and efficiency.

Hasura logo
Hasura

Senior Site Reliability Engineer (SRE) - Hasura Cloud

Join Hasura as a Senior Site Reliability Engineer to maintain and scale Hasura Cloud. Remote role in the US with competitive salary and benefits.

Hasura logo
Hasura

Senior Site Reliability Engineer (SRE) - Hasura Cloud

Join Hasura as a Senior Site Reliability Engineer to maintain and enhance Hasura Cloud's reliability and performance.

Happening logo
Happening

Site Reliability Engineer - Enablement

Join Happening as a Site Reliability Engineer to enhance gaming operations' performance and reliability using Kubernetes, Terraform, and more.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

Stability AI logo
Stability AI

Remote Data Engineer - Research

Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.

Stability AI logo
Stability AI

Senior Backend Engineer (AI)

Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.

GitLab logo
GitLab

Site Reliability Engineer - Delivery: Deployments, North America

Remote Site Reliability Engineer specializing in Delivery: Deployments at GitLab, focusing on improving delivery platforms and tooling.

Algolia logo
Algolia

Senior Site Reliability Engineer

Join Algolia as a Senior Site Reliability Engineer to enhance search product reliability and scalability. Remote work available.

Stability AI logo
Stability AI

Senior Data Engineer

Join Stability AI as a Senior Data Engineer to build scalable data infrastructure for AI models. Remote work from Germany.

Reddit, Inc. logo
Reddit, Inc.

Senior Solutions Engineer

Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.

OpenAI logo
OpenAI

Senior Software Engineer, Observability

Join OpenAI as a Senior Software Engineer in Observability, ensuring system reliability and scalability in a fast-paced environment.

netgo logo
netgo

Senior Cloud DevOps Engineer

Join netgo as a Senior Cloud DevOps Engineer in Berlin. Work with Kubernetes, GitOps, and more in a dynamic team environment.

Stability AI logo
Stability AI

Senior Data Platform Engineer

Senior Data Platform Engineer specializing in AWS and GCP services, data pipelines, and cloud infrastructure.

Coralogix logo
Coralogix

FullStack Engineer (Infrastructure Monitoring)

Join Coralogix as a FullStack Engineer to develop scalable solutions for Infrastructure Monitoring. Master Angular, Python, AWS, and more.

IBM logo
IBM

SRE Lead at IBM

Lead SRE role at IBM, overseeing system reliability, implementing best practices, and mentoring in New York.

MongoDB logo
MongoDB

Senior Site Reliability Engineer

Join MongoDB as a Senior Site Reliability Engineer in Berlin to design and build global cloud infrastructure, ensuring reliability and performance.

Climatiq logo
Climatiq

Senior Full Stack Engineer - Climate Tech - Rust & TypeScript

Join Climatiq as a Senior Full Stack Engineer to develop climate tech solutions using Rust and TypeScript. Remote work available.

Redcare Pharmacy logo
Redcare Pharmacy

Senior DevOps Engineer with Linux, Kubernetes, and GCP

Join Redcare Pharmacy as a Senior DevOps Engineer to enhance infrastructure efficiency using Linux, Kubernetes, and GCP.