NICE logo

Senior Cloud Site Reliability Engineer

NICE

Role Overview

Join NICE as a Senior Cloud Site Reliability Engineer and take on the challenge of improving the reliability and availability of our mission-critical cloud-based services. This role is pivotal in managing and enhancing the infrastructure that supports large, regionally distributed SaaS platforms.

Key Responsibilities

  • On-call Support: Provide essential support during major platform incidents to ensure rapid resolution and minimal downtime.
  • Automation and Efficiency: Identify and implement automation opportunities to enhance the efficiency and scalability of our SaaS infrastructure.
  • Observability Platform Management: Manage and maintain a Grafana-based observability platform to monitor service health across the fleet.
  • Release Engineering: Participate in the release process to ensure safe and reliable deployment of new features and updates.
  • Middleware and Tooling Development: Develop custom solutions to improve observability, reliability, and operability of the SaaS platform.
  • Collaboration with Development Teams: Work closely with developers to ensure production readiness and compliance with reliability standards.

How You'll Make an Impact

  • Enhanced Monitoring: Create dashboards and metrics for real-time observability of application performance.
  • Reliability Consulting: Provide expert advice on SRE practices to development teams, helping them enhance application reliability.
  • Incident Management: Lead efforts in data and performance analysis to pinpoint and rectify root causes of disruptions.
  • Knowledge Sharing: Mentor junior SREs and share knowledge to uplift the team's overall skill set.

What You Bring to the Team

  • Experience: At least 4 years in programming/scripting with languages like Go, Python, .Net (C#), Node.js.
  • Cloud Expertise: Proven background in managing public or private cloud environments, particularly AWS or Azure.
  • SRE/DevOps Background: Solid experience in site reliability engineering, DevOps, or related fields.
  • Certifications: Kubernetes certification is a plus.

Why Join NICE?

NICE offers a dynamic, innovative environment where you can grow professionally. Enjoy our NICE-FLEX hybrid work model, comprehensive benefits, and a culture that fosters diversity and inclusion.

About NICE

NICE Ltd. is a global leader in enterprise software solutions, empowering organizations through advanced analytics and cloud technologies. With over 8,500 employees worldwide, NICE is dedicated to excellence and innovation.

Benefits
Extracted with AI

  • Hybrid work model (2 days in-office, 3 days remote)
  • Opportunities for internal career advancement
  • Comprehensive health insurance
  • Paid vacation

Similar jobs

Last update: 23 minutes ago

Stability AI logo
Stability AI

Site Reliability Engineer (SRE) - Stability AI

Join Stability AI as a Site Reliability Engineer (SRE) to enhance cloud infrastructure and system reliability. Remote work available.

Microsoft logo
Microsoft

Senior Site Reliability Engineer

Join Microsoft as a Senior Site Reliability Engineer to design and deliver Office 365 government cloud services.

MongoDB logo
MongoDB

Senior Site Reliability Engineer

Join MongoDB as a Senior Site Reliability Engineer in Berlin to design and build global cloud infrastructure, ensuring reliability and performance.

Hasura logo
Hasura

Senior Site Reliability Engineer (SRE) - Hasura Cloud

Join Hasura as a Senior Site Reliability Engineer to maintain and scale Hasura Cloud. Remote role in the US with competitive salary and benefits.

Valtech logo
Valtech

Senior Site Reliability Engineer

Join Valtech as a Senior Site Reliability Engineer in Sofia, Bulgaria. Work with AWS, GCP, and Azure in a hybrid environment.

The Workshop logo
The Workshop

Site Reliability Engineering Manager

Lead a DevOps team in a dynamic IT environment, focusing on reliability engineering and cloud solutions.

Tint logo
Tint

Senior Site Reliability Engineer (AWS, Node.js)

Join Tint as a Senior Site Reliability Engineer to enhance AWS infrastructure efficiency and reliability. Remote role in the US.

Algolia logo
Algolia

Senior Site Reliability Engineer

Join Algolia as a Senior Site Reliability Engineer to enhance search product reliability and scalability. Remote work available.

ING logo
ING

Site Reliability Engineer

Join ING as a Site Reliability Engineer in Amsterdam. Tackle challenges in monitoring, resilience design, and lead SRE sessions.

Hasura logo
Hasura

Senior Site Reliability Engineer (SRE) - Hasura Cloud

Join Hasura as a Senior Site Reliability Engineer to maintain and enhance Hasura Cloud's reliability and performance.

Inclusively logo
Inclusively

Senior Cloud Engineer

Join as a Senior Cloud Engineer to architect and deploy cloud solutions using Azure, AWS, and GCP. Lead innovation in cloud technology.

Lightspeed Commerce logo
Lightspeed Commerce

Senior Site Reliability Expert

Join Lightspeed as a Senior Site Reliability Expert in Amsterdam. Work on cloud infrastructure, automation, and high availability systems.

CrowdStrike logo
CrowdStrike

Senior Software Engineer - Cloud Platform Reliability

Join CrowdStrike as a Senior Software Engineer focusing on cloud platform reliability and scalability in a remote-first role.

IBM logo
IBM

Senior Site Reliability Engineer

Senior Site Reliability Engineer at IBM in Cracow, skilled in AWS, Kubernetes, Linux, and Terraform.

Happening logo
Happening

Site Reliability Engineer - Enablement

Join Happening as a Site Reliability Engineer to enhance gaming operations' performance and reliability using Kubernetes, Terraform, and more.

Swift logo
Swift

Senior Site Reliability/DevOps Engineer (Hybrid)

Senior DevOps Engineer role in Manassas, VA focusing on site reliability, system analysis, and high availability systems.

Pure Storage logo
Pure Storage

Site Reliability Engineer, FlashArray

Join Pure Storage as a Site Reliability Engineer in Prague, focusing on cloud infrastructure uptime and incident response.

Hasura logo
Hasura

Site Reliability Engineer (SRE) - Hasura Cloud

Join Hasura as a Site Reliability Engineer to ensure smooth operation of Hasura Cloud systems, working remotely from India.

Adyen logo
Adyen

Senior Site Reliability Engineer

Join Adyen as a Senior Site Reliability Engineer in Amsterdam to ensure platform stability and reliability through automation and troubleshooting.

NVIDIA logo
NVIDIA

Senior Production SRE Engineer - Storage

Join NVIDIA as a Senior Production SRE Engineer - Storage, ensuring reliability of GPU cloud services with cutting-edge technologies.

Anduril Industries logo
Anduril Industries

Software Reliability Engineer

Join Anduril Industries as a Software Reliability Engineer in Seattle, WA. Develop cutting-edge software for electronic warfare systems.

IBM logo
IBM

SRE Lead at IBM

Lead SRE role at IBM, overseeing system reliability, implementing best practices, and mentoring in New York.

Adyen logo
Adyen

Senior Site Reliability Engineer - Production Platform

Join Adyen as a Senior Site Reliability Engineer in Amsterdam, focusing on automation, containerization, and distributed systems.

Google logo
Google

Senior Software Engineer, Site Reliability Engineering

Senior Software Engineer role in Site Reliability at Google, Dublin. Focus on large-scale systems and automation.