GitHub logo

Software Engineer II, Data Engineering

GitHub

Job Overview

Join GitHub as a Software Engineer II on the Copilot Metrics team, where you'll be at the forefront of data engineering. This role is pivotal in designing, developing, and maintaining efficient and reliable data pipelines. You'll collaborate with stakeholders across the company to gather business requirements, build data models, and ensure data quality and accessibility. Your expertise in Python, SQL, Airflow, and Spark will be crucial in optimizing our data infrastructure and enabling data-driven decision-making.

Responsibilities

Data Pipeline Development

  • Design, build, and maintain scalable data pipelines using Python, SQL, Airflow, and Spark.

Business Requirements Gathering

  • Collaborate with stakeholders to understand and translate business requirements into technical specifications.

Data Modeling

  • Develop and implement data models that support analytics and reporting needs, ensuring alignment with business goals.

Data Quality and Governance

  • Ensure data accuracy, consistency, and reliability by implementing robust data validation and quality checks.

Stakeholder Collaboration

  • Work with cross-functional teams, including data analysts, data scientists, and business leaders, to deliver high-quality data solutions.

Performance Optimization

  • Continuously monitor and optimize data pipelines for performance, scalability, and cost-efficiency.

Monitoring and Observability

  • Build and implement monitoring and observability metrics to ensure data quality and detect anomalies in data pipelines.

Documentation and Communication

  • Maintain clear and comprehensive documentation of data processes and effectively communicate technical concepts to non-technical stakeholders.

Qualifications

Required

  • 2+ years of experience in Software Engineering, Computer Science, or related technical discipline.
  • Proven experience maintaining production software coding in languages such as C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python.
  • 2+ years of experience in data engineering or analytics engineering roles.
  • Strong proficiency in Python, SQL, Airflow, and Spark.
  • Extensive expertise in building and maintaining robust data pipelines and ETL processes.

Preferred

  • Familiarity with Go and Ruby.
  • Experience with cloud platforms such as AWS, GCP, or Azure.
  • Familiarity with data warehousing solutions (e.g., Snowflake, Redshift, BigQuery).
  • Knowledge of data governance and data security best practices.
  • Excellent verbal and written communication skills.
  • Proven ability to work effectively in a collaborative, cross-functional environment.

Compensation

The base salary range for this job is USD $75,000.00 - USD $198,900.00 per year. Additional benefits include annual bonuses and stock options, with opportunities for sales incentives based on revenue or utilization.

About GitHub

GitHub is the world’s leading AI-powered developer platform with 100 million developers and counting. We’re also home to the biggest open-source community on earth. At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are.

Join us, and let’s change the world, together.

Benefits
Extracted with AI

  • Remote work
  • Competitive pay
  • Learning and growth opportunities
  • Annual bonus
  • Stock options
  • Diverse and inclusive environment

Similar jobs

Last update: 23 minutes ago

Remote Crew logo
Remote Crew

Senior Data Engineer

Join us as a Senior Data Engineer in Lisbon to design and maintain data infrastructure. Hybrid role with flexible benefits.

Stability AI logo
Stability AI

Remote Data Engineer - Research

Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.

Messari logo
Messari

Data Engineer with Blockchain and Cloud Experience

Join Messari as a Data Engineer to design blockchain data models, build dashboards, and derive insights. Remote role with competitive benefits.

Timescale logo
Timescale

Software Engineer - Backend (Python, Go)

Join Timescale as a Software Engineer - Backend, focusing on Python and Go, in a remote role with flexible benefits.

Discord logo
Discord

Senior Software Engineer - Data Platform

Join Discord as a Senior Software Engineer on the Data Platform team, working with GCP, Airflow, and BigQuery.

SPREAD AI logo
SPREAD AI

FullStack Software Developer

Join SPREAD AI as a FullStack Software Developer to innovate in data management and engineering intelligence.

SPREAD AI logo
SPREAD AI

FullStack Software Developer

Join SPREAD AI as a FullStack Software Developer in Berlin. Work with JavaScript, Python, Go, and more in a hybrid setup.

Airbnb logo
Airbnb

Senior Backend Engineer, Media Foundation

Join Airbnb as a Senior Backend Engineer to build and operate next-gen Media infrastructure. Remote eligible within the US.

Intuit logo
Intuit

Software Engineer 2 - Platform Security

Join Intuit as a Software Engineer 2 in Platform Security, focusing on cloud infrastructure and security best practices.

GitHub logo
GitHub

Corporate Solutions Engineer

Join GitHub as a Corporate Solutions Engineer in Europe, providing pre- and post-sales support for EMEA customers.

Intuit logo
Intuit

Software Engineer 2

Join Intuit as a Software Engineer 2 in New York to drive security best practices and innovate in cloud technologies.

Refuel logo
Refuel

Software Engineer - Platform

Join Refuel as a Software Engineer - Platform to design and develop critical features using Python, AWS, and LLMs in a hybrid work environment.

Cedar logo
Cedar

Software Engineer II (Client Integrations)

Join Cedar as a Software Engineer II to build client integrations in a remote role, enhancing healthcare technology.

GovWell logo
GovWell

Founding Data Engineer

Join GovWell as a Founding Data Engineer to build scalable data infrastructure for modernizing government services.

PlushCare logo
PlushCare

Data Engineer II

Join Accolade as a Data Engineer II in Prague. Design and maintain cloud-native data infrastructure using AWS and modern technologies.

FutureHouse logo
FutureHouse

Software Engineer - Member of Technical Staff

Join FutureHouse as a Software Engineer to innovate AI systems for scientific research in San Francisco.

The Home Depot logo
The Home Depot

Remote Software Engineer II

Join The Home Depot as a Remote Software Engineer II, focusing on front-end development, microservices, and cloud computing.

SSi People logo
SSi People

Senior Machine Learning Engineer

Join as a Senior Machine Learning Engineer to design and deploy advanced ML solutions using Python, Spark, and cloud platforms. Remote work opportunity.

OfferFit logo
OfferFit

Machine Learning Engineer

Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.

Hayden AI logo
Hayden AI

Senior Software Engineer, Backend

Join Hayden AI as a Senior Backend Engineer to build scalable cloud services using AWS, Python, and Go.

Argon AI (YC W24) logo
Argon AI (YC W24)

Senior Full Stack Software Engineer

Join Argon AI as a Senior Full Stack Software Engineer to build AI solutions for biopharma in NYC. Work with Python, PostgreSQL, and more.

MarketWise logo
MarketWise

AI/ML Data Engineer

Join MarketWise as an AI/ML Data Engineer to develop data pipelines and ETL processes using Python and cloud platforms.

Sanoma Learning logo
Sanoma Learning

Data Engineer with ETL and PySpark Experience

Join Sanoma Learning as a Data Engineer, focusing on ETL, PySpark, and data warehousing in a dynamic educational environment.

Keelvar logo
Keelvar

Staff Engineer - Python, Cloud, Distributed Systems

Join Keelvar as a Staff Engineer to lead design and architecture in a remote role, focusing on Python, cloud, and distributed systems.