Job Overview
Hasura is seeking a Senior Site Reliability Engineer (SRE) to join our team and help maintain the reliability and performance of Hasura Cloud. This role is crucial in ensuring that our systems run smoothly and updates are deployed without downtime.
Key Responsibilities
- Infrastructure Development: Build and maintain our infrastructure using tools like Terraform, Kubernetes, VMs, and bare metal instances.
- Scalability: Design and implement core infrastructure components to support thousands of concurrent requests.
- Cloud Expansion: Work on expanding Hasura Cloud to support multiple cloud providers.
- Deployment Process: Enhance the deployment process to ensure reliability and efficiency.
- Incident Response: Participate in a PagerDuty rotation to handle availability incidents and support service engineers.
- Systemic Issue Resolution: Use development time to address and prevent systemic issues.
- Monitoring and Alerts: Design monitoring systems that alert on symptoms rather than causes.
- Documentation and Automation: Document actions and automate repeatable tasks.
- Debugging: Troubleshoot production issues across services and stack levels.
- Infrastructure Growth Planning: Plan for the growth of Hasura Cloud's infrastructure.
Requirements
- Experience: 4+ years in a similar role.
- System Thinking: Ability to think about systems, edge cases, and failure modes.
- Linux and Unix Shell: Proficiency in navigating and using these systems.
- Infrastructure Tools: Experience with Terraform and similar tools.
- Programming Skills: Strong skills in Go and Python.
- Collaboration: Value asynchronous communication with a globally distributed team.
- Documentation: Enjoy documenting processes to avoid repetitive learning.
- Automation: Passion for building automation and tooling.
- Cloud Providers: Experience with AWS, GCP, Azure, and their APIs.
- Monitoring Tools: Familiarity with Honeycomb, Datadog, Prometheus, and Grafana.
Nice to Have
- Hasura Experience: Familiarity with Hasura and its GraphQL APIs.
- SQL and PostgreSQL: Strong fundamentals in SQL, especially with PostgreSQL.
- Database Management: Experience in database management and scaling.
Location
This position is remote, based in India, with an option to work from our Bangalore office.
Working at Hasura
At Hasura, we empower developers to build modern apps quickly. Our team is dedicated to improving the developer experience and making our tools as user-friendly as possible.
Perks
- Remote & Hybrid Work Environment: Flexible work model with options for remote or in-office work.
- Self-care Fridays: The second Friday of every month is a day off for personal rejuvenation.
- Equipment and Learning Allowance: Budgets for necessary tools and learning opportunities.
- Donation Matching: Annual fund to match donations to global organizations.
- Flexible Timings & PTO: Freedom to set work schedules and generous paid time off.
Applying
We encourage applications even if you don't meet all the requirements. We value diverse perspectives and are open to discussing how you can contribute to our team.
Hasura is an equal opportunity employer, committed to diversity and inclusion in the workplace.
Benefits Extracted with AI
- Remote & Hybrid Work Environment
- Self-care Fridays
- Equipment and learning allowance
- Donation Matching
- Flexible timings & PTO
Similar jobs
Last update: 23 minutes ago
Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Site Reliability Engineer to ensure smooth operation of Hasura Cloud systems, working remotely from India.
Senior Site Reliability Engineer (SRE) - Hasura Cloud
Join Hasura as a Senior Site Reliability Engineer to maintain and scale Hasura Cloud. Remote role in the US with competitive salary and benefits.
Senior/Staff Software Engineer - Backend
Join Hasura as a Senior/Staff Software Engineer - Backend, working remotely in India, focusing on scalable distributed systems and cloud services.
Site Reliability Engineer (SRE) - Stability AI
Join Stability AI as a Site Reliability Engineer (SRE) to enhance cloud infrastructure and system reliability. Remote work available.
Senior DevOps Engineer
Join saas.group as a Senior DevOps Engineer, working remotely to manage and optimize our central infrastructure.
Senior Site Reliability Engineer
Join Algolia as a Senior Site Reliability Engineer to enhance search product reliability and scalability. Remote work available.
Senior Site Reliability Engineer
Join Valtech as a Senior Site Reliability Engineer in Sofia, Bulgaria. Work with AWS, GCP, and Azure in a hybrid environment.
Senior Site Reliability Engineer
Join MongoDB as a Senior Site Reliability Engineer in Berlin to design and build global cloud infrastructure, ensuring reliability and performance.
Senior Platform Engineer, SRE
Join HelloFresh as a Senior Platform Engineer, SRE in Berlin. Work on infrastructure automation, observability, and reliability.
Site Reliability Engineer - Enablement
Join Happening as a Site Reliability Engineer to enhance gaming operations' performance and reliability using Kubernetes, Terraform, and more.
Senior Software Engineer - Cloud Platform Reliability
Join CrowdStrike as a Senior Software Engineer focusing on cloud platform reliability and scalability in a remote-first role.
Senior Software Engineer - Cloud Operations
Senior Software Engineer for Cloud Ops at Sourcegraph, specializing in cloud infrastructure, Kubernetes, and Terraform.
Senior Platform Engineer SRE
Senior Platform Engineer SRE role at HelloFresh in Berlin, focusing on reliability, automation, and observability.
Senior Site Reliability Expert
Join Lightspeed as a Senior Site Reliability Expert in Amsterdam. Work on cloud infrastructure, automation, and high availability systems.
Senior Cloud Site Reliability Engineer
Senior Cloud Site Reliability Engineer role focusing on enhancing cloud service reliability and efficiency.
Senior Software Engineer, GraphQL
Senior Software Engineer role focusing on GraphQL, system performance, and reliability in San Francisco, CA.
Senior Infrastructure Engineer
Senior Infrastructure Engineer needed to enhance cloud-based platforms using Golang, AWS, Azure, and GCP in San Francisco.
Senior Backend Engineer
Join Grafana Labs as a Senior Backend Engineer, working remotely in the US/Canada on Kubernetes monitoring.
Site Reliability Engineering Manager
Lead a DevOps team in a dynamic IT environment, focusing on reliability engineering and cloud solutions.
Senior Site Reliability Engineer
Join Microsoft as a Senior Site Reliability Engineer to design and deliver Office 365 government cloud services.
Senior Systems Engineer - Cloud Infrastructure
Senior Systems Engineer role focusing on cloud infrastructure, AWS, DevOps, and system architecture at a leading payment orchestration company.
Senior Cloud Engineer
Join as a Senior Cloud Engineer to architect and deploy cloud solutions using Azure, AWS, and GCP. Lead innovation in cloud technology.
Senior Software Engineer/SRE - Public Cloud Solutions
Join Bloomberg as a Senior Software Engineer/SRE to drive cloud adoption and build scalable solutions using Python, Terraform, and cloud platforms.
Senior Site Reliability Engineer - Platform
Join Monta as a Senior Site Reliability Engineer to manage AWS Kubernetes infrastructure and enhance EV charging solutions.