Role Overview
Join NICE as a Senior Cloud Site Reliability Engineer and take on the challenge of improving the reliability and availability of our mission-critical cloud-based services. This role is pivotal in managing and enhancing the infrastructure that supports large, regionally distributed SaaS platforms.
Key Responsibilities
- On-call Support: Provide essential support during major platform incidents to ensure rapid resolution and minimal downtime.
- Automation and Efficiency: Identify and implement automation opportunities to enhance the efficiency and scalability of our SaaS infrastructure.
- Observability Platform Management: Manage and maintain a Grafana-based observability platform to monitor service health across the fleet.
- Release Engineering: Participate in the release process to ensure safe and reliable deployment of new features and updates.
- Middleware and Tooling Development: Develop custom solutions to improve observability, reliability, and operability of the SaaS platform.
- Collaboration with Development Teams: Work closely with developers to ensure production readiness and compliance with reliability standards.
How You'll Make an Impact
- Enhanced Monitoring: Create dashboards and metrics for real-time observability of application performance.
- Reliability Consulting: Provide expert advice on SRE practices to development teams, helping them enhance application reliability.
- Incident Management: Lead efforts in data and performance analysis to pinpoint and rectify root causes of disruptions.
- Knowledge Sharing: Mentor junior SREs and share knowledge to uplift the team's overall skill set.
What You Bring to the Team
- Experience: At least 4 years in programming/scripting with languages like Go, Python, .Net (C#), Node.js.
- Cloud Expertise: Proven background in managing public or private cloud environments, particularly AWS or Azure.
- SRE/DevOps Background: Solid experience in site reliability engineering, DevOps, or related fields.
- Certifications: Kubernetes certification is a plus.
Why Join NICE?
NICE offers a dynamic, innovative environment where you can grow professionally. Enjoy our NICE-FLEX hybrid work model, comprehensive benefits, and a culture that fosters diversity and inclusion.
About NICE
NICE Ltd. is a global leader in enterprise software solutions, empowering organizations through advanced analytics and cloud technologies. With over 8,500 employees worldwide, NICE is dedicated to excellence and innovation.
Benefits Extracted with AI
- Hybrid work model (2 days in-office, 3 days remote)
- Opportunities for internal career advancement
- Comprehensive health insurance
- Paid vacation
Similar jobs
Last update: 23 minutes ago
Site Reliability Engineer (SRE) - Stability AI
Join Stability AI as a Site Reliability Engineer (SRE) to enhance cloud infrastructure and system reliability. Remote work available.
Senior Site Reliability Engineer
Join Microsoft as a Senior Site Reliability Engineer to design and deliver Office 365 government cloud services.
Senior Site Reliability Engineer
Join MongoDB as a Senior Site Reliability Engineer in Berlin to design and build global cloud infrastructure, ensuring reliability and performance.
Senior Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Senior Site Reliability Engineer to maintain and scale Hasura Cloud. Remote role in the US with competitive salary and benefits.
Senior Site Reliability Engineer
Join Valtech as a Senior Site Reliability Engineer in Sofia, Bulgaria. Work with AWS, GCP, and Azure in a hybrid environment.
Site Reliability Engineering Manager
Lead a DevOps team in a dynamic IT environment, focusing on reliability engineering and cloud solutions.
Senior Site Reliability Engineer (AWS, Node.js)
Join Tint as a Senior Site Reliability Engineer to enhance AWS infrastructure efficiency and reliability. Remote role in the US.
Senior Site Reliability Engineer
Join Algolia as a Senior Site Reliability Engineer to enhance search product reliability and scalability. Remote work available.
Site Reliability Engineer
Join ING as a Site Reliability Engineer in Amsterdam. Tackle challenges in monitoring, resilience design, and lead SRE sessions.
Senior Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Senior Site Reliability Engineer to maintain and enhance Hasura Cloud's reliability and performance.
Senior Cloud Engineer
Join as a Senior Cloud Engineer to architect and deploy cloud solutions using Azure, AWS, and GCP. Lead innovation in cloud technology.
Senior Site Reliability Expert
Join Lightspeed as a Senior Site Reliability Expert in Amsterdam. Work on cloud infrastructure, automation, and high availability systems.
Senior Software Engineer - Cloud Platform Reliability
Join CrowdStrike as a Senior Software Engineer focusing on cloud platform reliability and scalability in a remote-first role.
Senior Site Reliability Engineer
Senior Site Reliability Engineer at IBM in Cracow, skilled in AWS, Kubernetes, Linux, and Terraform.
Site Reliability Engineer - Enablement
Join Happening as a Site Reliability Engineer to enhance gaming operations' performance and reliability using Kubernetes, Terraform, and more.
Senior Site Reliability/DevOps Engineer (Hybrid)
Senior DevOps Engineer role in Manassas, VA focusing on site reliability, system analysis, and high availability systems.
Site Reliability Engineer, FlashArray
Join Pure Storage as a Site Reliability Engineer in Prague, focusing on cloud infrastructure uptime and incident response.
Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Site Reliability Engineer to ensure smooth operation of Hasura Cloud systems, working remotely from India.
Senior Site Reliability Engineer
Join Adyen as a Senior Site Reliability Engineer in Amsterdam to ensure platform stability and reliability through automation and troubleshooting.
Senior Production SRE Engineer - Storage
Join NVIDIA as a Senior Production SRE Engineer - Storage, ensuring reliability of GPU cloud services with cutting-edge technologies.
Software Reliability Engineer
Join Anduril Industries as a Software Reliability Engineer in Seattle, WA. Develop cutting-edge software for electronic warfare systems.
SRE Lead at IBM
Lead SRE role at IBM, overseeing system reliability, implementing best practices, and mentoring in New York.
Senior Site Reliability Engineer - Production Platform
Join Adyen as a Senior Site Reliability Engineer in Amsterdam, focusing on automation, containerization, and distributed systems.
Senior Software Engineer, Site Reliability Engineering
Senior Software Engineer role in Site Reliability at Google, Dublin. Focus on large-scale systems and automation.