Introduction
IBM Technology Zone is the one stop shop for IBMers and business partners to build, show, and share solutions built on IBM technologies to facilitate opportunity progression and customer adoption.
This is a new role in TechZone that provides leadership in designing, implementing, and managing the overall observability framework for the organization. This includes setting up monitoring systems, logging tools, and tracing solutions to provide comprehensive visibility into the system's performance, health, and usage patterns. Additionally, they collaborate with various teams to integrate observability into their development workflows, ensuring that issues can be detected and resolved quickly. The Observability Lead also contributes to the continuous improvement of the observability practice by staying updated on latest trends, technologies, and best practices.
Your Role and Responsibilities
- Develop and implement comprehensive observability solutions, including monitoring, logging, tracing, and alerting systems.
- Work closely with developers, DevOps, and SRE teams to understand their requirements and ensure the observability tools meet their needs.
- Continuously improve the observability infrastructure to ensure system reliability, performance, and scalability.
- Collaborate in building and maintaining TechZone automation that deploys and provisions environments at scale.
- Participate in incident response efforts, providing valuable insights through observability data to diagnose and resolve issues quickly.
- Analyze observability data to identify trends, bottlenecks, and potential issues. Generate reports and dashboards for stakeholders.
- Evaluate and integrate new observability tools and technologies to enhance our monitoring capabilities.
- Develop and promote best practices for observability within the organization, including documentation and training for engineering teams.
Required Technical and Professional Expertise
- Experience: Proven experience in observability, monitoring, or related fields, with a strong understanding of modern observability practices.
- Technical Skills: Proficiency with observability tools such as Prometheus, Grafana, ELK stack, Jaeger, or similar. Strong scripting and automation skills (e.g., Python, Bash).
- System Knowledge: In-depth knowledge of distributed systems, microservices architecture, and cloud platforms (e.g., AWS, Azure, GCP).
- Problem-Solving: Excellent analytical and problem-solving skills, with the ability to diagnose complex issues using observability data.
- Communication: Strong communication skills, with the ability to collaborate effectively with cross-functional teams and present findings to stakeholders.
- Education: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
Preferred Technical And Professional Expertise
- Preferred consideration will be given to candidates with a development background and demonstrated programming experience.
- Preference to candidates with cloud architecture experience.
Benefits Extracted with AI
- Pension plan
Similar jobs
Last update: 23 minutes ago
Lead Developer with DevOps and Functional Programming
Join Reaktor as a Lead Developer in Amsterdam, focusing on DevOps, Functional Programming, and JavaScript in a hybrid work environment.
Senior Systems Engineer, Managed Operations
Join AWS as a Senior Systems Engineer in Berlin to lead operations for the European Sovereign Cloud, ensuring high-availability AWS services.
Associate Integration Solutions Technical Lead
Join EIB as an Associate Integration Solutions Technical Lead in Luxembourg, driving seamless integration solutions with cutting-edge technologies.
Cloud Solution Engineer (IC4)
Join Oracle as a Cloud Solution Engineer to design and deploy cloud architectures, driving customer success in Amsterdam.
Cloud Engineer
Join Tibo Energy as a Cloud Engineer to drive energy transition with cloud architecture skills in a dynamic team.
Oracle Cloud Engineer
Join Albert Heijn as an Oracle Cloud Engineer to drive automation and manage cloud infrastructure in Zaandam, Netherlands.
Staff Software Engineer: Data & Infrastructure Platforms - Metrics & Alerting
Join Uber's Amsterdam team as a Staff Software Engineer focusing on Data & Infrastructure Platforms, specializing in Metrics & Alerting.
Production Engineer
Join Optiver as a Production Engineer in Amsterdam to manage live trading environments and enhance system reliability and performance.
Senior Elastic Stack as a Service (ELKaaS) DevOps Engineer
Join ING as a Senior DevOps Engineer to enhance our ELKaaS platform, leveraging Docker, Kubernetes, and Azure in a hybrid work environment.
Senior Backend Developer with TypeScript
Join Tibo Energy as a Senior Backend Developer to lead TypeScript-based solutions in energy management.
Senior Backend Developer (Node.js) / SRE
Join Binance as a Senior Backend Developer (Node.js) / SRE to develop monitoring systems for high-load production environments.
Senior Java Developer
Join our team as a Senior Java Developer to design and develop high-quality software applications in a dynamic, hybrid work environment.
Senior Full-Stack Engineer ReactJS/NodeJS
Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Full Stack Team Leader .Net
Lead a remote full-stack team with .NET and Angular expertise, focusing on technical leadership and hands-on development.
Senior Software Engineer: Configuration Management/Deployment
Join Uber's Amsterdam team as a Senior Software Engineer focusing on configuration management and deployment. Solve infrastructure challenges at scale.
Senior Fullstack Developer for AI-Driven Mission Technologies
Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.
Senior Security Engineer
Join Atos as a Senior Security Engineer in Amstelveen, focusing on cybersecurity, network security, and vulnerability management.
Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering
Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.
Senior IoT Engineer
Join Skytree as a Senior IoT Engineer to lead IoT projects, focusing on Azure IoT solutions, edge computing, and data pipelines.
FullStack Engineer (Infrastructure Monitoring)
Join Coralogix as a FullStack Engineer to develop scalable solutions for Infrastructure Monitoring. Master Angular, Python, AWS, and more.
Senior Software Engineer (Elasticsearch)
Join Uber as a Senior Software Engineer specializing in Elasticsearch, working on sophisticated infrastructure across data centers and cloud.
Senior Solutions Engineer
Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.
Software Development Lead
Join Monument Belgium as a Software Development Lead to oversee and align development practices across Belgian locations.