Join NVIDIA as a Senior DevOps Engineer
NVIDIA is seeking a highly skilled and motivated Senior DevOps Engineer to join our Data and Application Services team. This role is pivotal in enhancing our growing services infrastructure, with a focus on our multi-tenant Kubernetes platform designed to run a variety of in-house application services.
Key Responsibilities
- Service Ownership: Collaborate with cross-functional teams to own and improve the services you build.
- Frequent Testing and Deployment: Be comfortable with frequent code testing and deployment to ensure service reliability.
- Infrastructure Automation: Continuously improve infrastructure provisioning and management using automation tools.
- Service Resiliency: Identify areas to enhance service resiliency through industry-standard practices.
- Multi-Cloud Environment: Support a globally distributed, multi-cloud hybrid environment including AWS, GCP, and on-premises solutions.
- Incident Management: Determine root causes for production-level incidents and write high-quality RCA reports.
- Operational Excellence: Ensure the highest level of uptime and Quality of Service (QoS) to internal customers.
- Service Level Objectives: Define SLOs and SLIs to represent and measure service quality.
- On-Call Rotation: Participate in the team's on-call rotation to provide 24/7 support.
Required Skills and Experience
- Experience: 7+ years in operating services including web servers, load balancers, databases, messaging systems, and storage solutions.
- Programming: 3+ years of coding/scripting in at least two high-level programming languages such as Python, Go, Ruby, or Groovy.
- Linux and Networking: Deep understanding of Linux operating systems and TCP/IP fundamentals.
- Cloud Expertise: Expertise with at least one major cloud service provider - AWS, GCP, or Azure.
- CI/CD and GitOps: Proficient in modern CI/CD techniques, GitOps, and Infrastructure as Code (IaC).
- Observability: Hands-on experience running production-quality observability stacks.
- Problem Solving: Creative problem solver with excellent debugging skills.
- Education: B.S. degree in Computer Science or related technical field (or equivalent experience).
- Communication: Detail-oriented with great communication and documentation skills.
Preferred Qualifications
- Linux Certification: Certification from a well-known vendor such as RedHat or Oracle.
- Kubernetes: Prior experience managing large-scale Kubernetes deployments in production.
- Container Networking: Strong skills in modern container networking and storage architecture.
Compensation and Benefits
- Salary Range: €164,000 - €310,500 per year.
- Equity: Eligible for equity.
- Benefits: Comprehensive benefits package including health insurance and paid vacation.
NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We highly value diversity in our current and future employees and do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
Join us in shaping the future of accelerated computing and AI. Apply today to be part of a team that is redefining the industry.
Benefits Extracted with AI
- Equity
- Health Insurance
- Paid Vacation
- Diverse Work Environment
Similar jobs
Last update: 23 minutes ago
Senior Software Engineer, AI Platform - Robotics
Senior Software Engineer needed for AI Robotics platform at NVIDIA, Santa Clara. Involves cloud platforms, Kubernetes, Python.
Senior Full-Stack Software Engineer
Join NVIDIA as a Senior Full-Stack Software Engineer, working on cutting-edge web applications and infrastructure.
Senior Software & Cloud Architect
Join NVIDIA as a Senior Software & Cloud Architect to lead cloud-based orchestration and provisioning solutions.
Senior Backend Engineer, AI Platform - Robotics
Join NVIDIA as a Senior Backend Engineer to develop AI platforms for robotics. Work remotely with cutting-edge technology.
Senior Software and System Architect
Join NVIDIA as a Senior Software and System Architect to lead cloud-networking and security solutions, focusing on cutting-edge technologies.
Senior Full-Stack Web Applications Software Engineer
Join NVIDIA as a Senior Full-Stack Web Applications Software Engineer. Work on scalable web services and infrastructure.
Senior Distributed Systems Backend Engineer
Join NVIDIA as a Senior Distributed Systems Backend Engineer to shape the future of Cloud Gaming with GeForce NOW.
Senior Software Architect – Data Center Platform Simulation and Virtualization
Join NVIDIA as a Senior Software Architect focusing on data center platform simulation and virtualization.
Senior Full Stack Engineer, Deep Learning Algorithms
Join NVIDIA as a Senior Full Stack Engineer to build software for AI, focusing on deep learning algorithms and high-performance computing.
Senior Software Architect, Advanced Development
Join NVIDIA as a Senior Software Architect in Advanced Development, focusing on innovative solutions in network programmability and data centers.
Senior Software Engineer - HPC
Senior Software Engineer for HPC at NVIDIA in Westford, MA. Design and improve high-performance computing systems.
Senior Production SRE Engineer - Storage
Join NVIDIA as a Senior Production SRE Engineer - Storage, ensuring reliability of GPU cloud services with cutting-edge technologies.
Senior Deep Learning Performance Software Engineer
Senior role optimizing deep learning performance at NVIDIA, involving Python, HPC, and AI technologies.
Senior Frontend Engineer, AI Platform - Robotics
Join NVIDIA as a Senior Frontend Engineer to develop AI platform UIs using React, Angular, and Vue.js.
Senior Cloud Engineer
Join as a Senior Cloud Engineer to architect and deploy cloud solutions using Azure, AWS, and GCP. Lead innovation in cloud technology.
Senior DevOps Engineer with GCP and Kubernetes
Join Revinate as a Senior DevOps Engineer, specializing in GCP and Kubernetes, to drive innovation and support global teams.
DevOps Engineer with Kubernetes and CI/CD Experience
Join ITQ as a DevOps Engineer to work with Kubernetes, CI/CD, and cloud-native technologies in a hybrid environment.
Software Engineering Intern
Join NVIDIA as a Software Engineering Intern in 2025. Work remotely on AI, cloud, and data science projects. Enhance your skills in a diverse environment.
Trustworthy AI Software Engineer
Join NVIDIA as a Trustworthy AI Software Engineer in Santa Clara, CA. Develop cutting-edge AI tools and models in a multidisciplinary team.
Senior Frontend Web Developer
Join NVIDIA as a Senior Frontend Web Developer to design and develop scalable web applications using Vue.js and Node.js.
Senior Deep Learning Engineer
Join NVIDIA as a Senior Deep Learning Engineer to optimize AI performance using PyTorch, TensorFlow, and more in Berlin.
Senior Software Engineer, Data Ingestion - Autonomous Vehicles
Senior Software Engineer for Data Ingestion in Autonomous Vehicles at NVIDIA, Santa Clara. Expertise in Go, distributed systems required.
Senior Artificial Intelligence Algorithms Engineer
Senior AI Algorithms Engineer role focusing on AI/DL, data analytics, and machine learning in Santa Clara, CA.
Senior DevOps Engineer with Linux, Kubernetes, and GCP
Join Redcare Pharmacy as a Senior DevOps Engineer to enhance infrastructure efficiency using Linux, Kubernetes, and GCP.