Software Engineer II, Data Engineering
GitHubJob Overview
Join GitHub as a Software Engineer II on the Copilot Metrics team, where you'll be at the forefront of data engineering. This role is pivotal in designing, developing, and maintaining efficient and reliable data pipelines. You'll collaborate with stakeholders across the company to gather business requirements, build data models, and ensure data quality and accessibility. Your expertise in Python, SQL, Airflow, and Spark will be crucial in optimizing our data infrastructure and enabling data-driven decision-making.
Responsibilities
Data Pipeline Development
- Design, build, and maintain scalable data pipelines using Python, SQL, Airflow, and Spark.
Business Requirements Gathering
- Collaborate with stakeholders to understand and translate business requirements into technical specifications.
Data Modeling
- Develop and implement data models that support analytics and reporting needs, ensuring alignment with business goals.
Data Quality and Governance
- Ensure data accuracy, consistency, and reliability by implementing robust data validation and quality checks.
Stakeholder Collaboration
- Work with cross-functional teams, including data analysts, data scientists, and business leaders, to deliver high-quality data solutions.
Performance Optimization
- Continuously monitor and optimize data pipelines for performance, scalability, and cost-efficiency.
Monitoring and Observability
- Build and implement monitoring and observability metrics to ensure data quality and detect anomalies in data pipelines.
Documentation and Communication
- Maintain clear and comprehensive documentation of data processes and effectively communicate technical concepts to non-technical stakeholders.
Qualifications
Required
- 2+ years of experience in Software Engineering, Computer Science, or related technical discipline.
- Proven experience maintaining production software coding in languages such as C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python.
- 2+ years of experience in data engineering or analytics engineering roles.
- Strong proficiency in Python, SQL, Airflow, and Spark.
- Extensive expertise in building and maintaining robust data pipelines and ETL processes.
Preferred
- Familiarity with Go and Ruby.
- Experience with cloud platforms such as AWS, GCP, or Azure.
- Familiarity with data warehousing solutions (e.g., Snowflake, Redshift, BigQuery).
- Knowledge of data governance and data security best practices.
- Excellent verbal and written communication skills.
- Proven ability to work effectively in a collaborative, cross-functional environment.
Compensation
The base salary range for this job is USD $75,000.00 - USD $198,900.00 per year. Additional benefits include annual bonuses and stock options, with opportunities for sales incentives based on revenue or utilization.
About GitHub
GitHub is the world’s leading AI-powered developer platform with 100 million developers and counting. We’re also home to the biggest open-source community on earth. At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are.
Join us, and let’s change the world, together.
Benefits Extracted with AI
- Remote work
- Competitive pay
- Learning and growth opportunities
- Annual bonus
- Stock options
- Diverse and inclusive environment
Similar jobs
Last update: 23 minutes ago
Senior Data Engineer
Join us as a Senior Data Engineer in Lisbon to design and maintain data infrastructure. Hybrid role with flexible benefits.
Remote Data Engineer - Research
Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.
Data Engineer with Blockchain and Cloud Experience
Join Messari as a Data Engineer to design blockchain data models, build dashboards, and derive insights. Remote role with competitive benefits.
Software Engineer - Backend (Python, Go)
Join Timescale as a Software Engineer - Backend, focusing on Python and Go, in a remote role with flexible benefits.
Senior Software Engineer - Data Platform
Join Discord as a Senior Software Engineer on the Data Platform team, working with GCP, Airflow, and BigQuery.
FullStack Software Developer
Join SPREAD AI as a FullStack Software Developer to innovate in data management and engineering intelligence.
FullStack Software Developer
Join SPREAD AI as a FullStack Software Developer in Berlin. Work with JavaScript, Python, Go, and more in a hybrid setup.
Senior Backend Engineer, Media Foundation
Join Airbnb as a Senior Backend Engineer to build and operate next-gen Media infrastructure. Remote eligible within the US.
Software Engineer 2 - Platform Security
Join Intuit as a Software Engineer 2 in Platform Security, focusing on cloud infrastructure and security best practices.
Corporate Solutions Engineer
Join GitHub as a Corporate Solutions Engineer in Europe, providing pre- and post-sales support for EMEA customers.
Software Engineer 2
Join Intuit as a Software Engineer 2 in New York to drive security best practices and innovate in cloud technologies.
Software Engineer - Platform
Join Refuel as a Software Engineer - Platform to design and develop critical features using Python, AWS, and LLMs in a hybrid work environment.
Software Engineer II (Client Integrations)
Join Cedar as a Software Engineer II to build client integrations in a remote role, enhancing healthcare technology.
Founding Data Engineer
Join GovWell as a Founding Data Engineer to build scalable data infrastructure for modernizing government services.
Data Engineer II
Join Accolade as a Data Engineer II in Prague. Design and maintain cloud-native data infrastructure using AWS and modern technologies.
Software Engineer - Member of Technical Staff
Join FutureHouse as a Software Engineer to innovate AI systems for scientific research in San Francisco.
Remote Software Engineer II
Join The Home Depot as a Remote Software Engineer II, focusing on front-end development, microservices, and cloud computing.
Senior Machine Learning Engineer
Join as a Senior Machine Learning Engineer to design and deploy advanced ML solutions using Python, Spark, and cloud platforms. Remote work opportunity.
Machine Learning Engineer
Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.
Senior Software Engineer, Backend
Join Hayden AI as a Senior Backend Engineer to build scalable cloud services using AWS, Python, and Go.
Senior Full Stack Software Engineer
Join Argon AI as a Senior Full Stack Software Engineer to build AI solutions for biopharma in NYC. Work with Python, PostgreSQL, and more.
AI/ML Data Engineer
Join MarketWise as an AI/ML Data Engineer to develop data pipelines and ETL processes using Python and cloud platforms.
Data Engineer with ETL and PySpark Experience
Join Sanoma Learning as a Data Engineer, focusing on ETL, PySpark, and data warehousing in a dynamic educational environment.
Staff Engineer - Python, Cloud, Distributed Systems
Join Keelvar as a Staff Engineer to lead design and architecture in a remote role, focusing on Python, cloud, and distributed systems.