Overview
AB Tasty is a global leader in AI-powered experience optimization solutions, empowering brands using personalization, experimentation, recommendations, and search to build better experiences on their websites and apps. Integrated into a single platform, AB Tasty offers web and API-based solutions that provide companies with a unified approach to creating seamless experiences for customers.
Role Summary
We are in search of a seasoned Lead Site Reliability Engineer (SRE) to steer our SRE team. This pivotal role involves leading the technical development and strategic implementation of our monitoring stack, SLI/SLO initiatives in collaboration with the product team, and enhancing the reliability and performance of our services.
Responsibilities
- Lead the technical direction for the SRE team in migrating and managing services across AWS and GCP platforms, ensuring high availability, performance, and reliability.
- Collaborate with product and technical teams to define and implement SLI - SLO that aligns with business needs.
- Design, implement, and manage monitoring solutions to ensure proactive incident management and operational excellence.
- Develop and maintain infrastructure as code using tools like Terraform, ensuring scalable and maintainable infrastructure.
- Architect and implement automation for deployment, scaling, and management of systems using scripting languages like Python or Go.
- Drive the adoption of best practices in security, compliance, and reliability across the organization.
- Foster a culture of continuous improvement by leading post-mortem analysis and implementing preventive measures to avoid the recurrence of incidents.
- Develop comprehensive documentation for system architectures, configurations, processes, and service records.
What We Offer
- Huge impact on a publicly accessible SaaS platform used by all our clients.
- No micromanaging, full ownership of your tasks.
- International reach with a wildly international audience and team.
- Continuous education with many opportunities for professional and non-professional growth.
- Unique career opportunities in a fast-growing tech industry.
- Lots of fun with team games, drinks, yoga classes, parties, and a company-wide retreat every year.
- Remote working flexibility with a smooth policy allowing up to 3 days a week remote work.
- Time for introspection with a Retreat Day after a year within AB Tasty.
Benefits Extracted with AI
- Remote working flexibility
- Continuous education opportunities
- Unique career opportunities
- International reach
- Time for introspection (Retreat Day)
Similar jobs
Last update: 23 minutes ago
Senior Site Reliability Engineer
Join Algolia as a Senior Site Reliability Engineer to enhance search product reliability and scalability. Remote work available.
Site Reliability Engineer - Enablement
Join Happening as a Site Reliability Engineer to enhance gaming operations' performance and reliability using Kubernetes, Terraform, and more.
Senior Site Reliability Engineer
Join Adyen as a Senior Site Reliability Engineer in Amsterdam to ensure platform stability and reliability through automation and troubleshooting.
Senior Site Reliability Engineer
Join Valtech as a Senior Site Reliability Engineer in Sofia, Bulgaria. Work with AWS, GCP, and Azure in a hybrid environment.
Site Reliability Engineer (SRE) - Stability AI
Join Stability AI as a Site Reliability Engineer (SRE) to enhance cloud infrastructure and system reliability. Remote work available.
Senior Site Reliability Expert
Join Lightspeed as a Senior Site Reliability Expert in Amsterdam. Work on cloud infrastructure, automation, and high availability systems.
Senior Site Reliability Engineer - Production Platform
Join Adyen as a Senior Site Reliability Engineer in Amsterdam, focusing on automation, containerization, and distributed systems.
Senior Site Reliability Engineer (AWS, Node.js)
Join Tint as a Senior Site Reliability Engineer to enhance AWS infrastructure efficiency and reliability. Remote role in the US.
Lead Software Engineer – SRE (Relocation to Bangkok)
Lead SRE Software Engineer role in Brno, Czechia. Involves relocation to Bangkok, system reliability focus, and diverse team collaboration.
Senior Systems Engineer, Managed Operations
Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.
Senior Site Reliability Engineer
Senior Site Reliability Engineer in Amsterdam, skilled in AWS, Kubernetes, Terraform. Hybrid work, competitive benefits.
Senior Platform Engineer, SRE
Join HelloFresh as a Senior Platform Engineer, SRE in Berlin. Work on infrastructure automation, observability, and reliability.
Site Reliability Engineer
Join Tangelo Games as a Site Reliability Engineer in Barcelona. Enhance infrastructure, ensure system quality, and foster team collaboration.
Senior Cloud Site Reliability Engineer
Senior Cloud Site Reliability Engineer role focusing on enhancing cloud service reliability and efficiency.
Site Reliability Engineering Manager
Lead a DevOps team in a dynamic IT environment, focusing on reliability engineering and cloud solutions.
Senior Site Reliability Engineer
Join MongoDB as a Senior Site Reliability Engineer in Berlin to design and build global cloud infrastructure, ensuring reliability and performance.
Senior Platform Engineer SRE
Senior Platform Engineer SRE role at HelloFresh in Berlin, focusing on reliability, automation, and observability.
Senior Platform Engineer
Join PlayPlay as a Senior Platform Engineer in Paris, enhancing infrastructure and developer experience in a dynamic team.
Senior Systems Engineer, Managed Operations
Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.
Senior Site Reliability Engineer - OSDU
Join VASS as a Senior Site Reliability Engineer in Brussels, enhancing platform reliability and availability for the European Commission.
Senior Backend Engineer, Automations
Senior Backend Engineer role focusing on automation, DevOps, Node.js, TypeScript, and remote work.
Senior Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Senior Site Reliability Engineer to maintain and enhance Hasura Cloud's reliability and performance.
Senior Site Reliability Engineer
Senior Site Reliability Engineer at IBM in Cracow, skilled in AWS, Kubernetes, Linux, and Terraform.
Lead Data Engineer
Join Partoo as a Lead Data Engineer in Paris, managing data pipelines, AI projects, and a team, with a focus on innovation and data security.