Be Part of Building the Future
Dremio is the unified lakehouse platform for self-service analytics and AI, serving hundreds of global enterprises. Customers rely on Dremio for cloud, hybrid, and on-prem lakehouses to power their data mesh, data warehouse migration, data virtualization, and unified data access use cases. Based on open source technologies, including Apache Iceberg and Apache Arrow, Dremio provides an open lakehouse architecture enabling the fastest time to insight and platform flexibility at a fraction of the cost.
About The Role
We are seeking a talented and motivated Senior Software Engineer to join our Datalake team within the Query Engine organization. In this role, you will focus on enhancing our query engine with a particular emphasis on the Iceberg table format and efficient scans of various file formats. This is an exciting opportunity to contribute to cutting-edge technology in the big data ecosystem.
What You’ll Be Doing
- Designing and implementing features for Dremio’s query engine with a focus on the Iceberg table format.
- Optimizing file scan operations for various file formats, including Parquet, Avro, and others.
- Collaborating with members of the query planning and query execution teams to ensure seamless integration of features across the code stack.
- Working with and contributing to open-source projects like Apache Iceberg, Parquet, and Arrow.
- Maintaining and enhancing compliance with the Iceberg table format specification.
- Conducting performance tuning and benchmarking to enhance query execution speed.
- Understanding and reasoning about concurrency and parallelization to deliver scalability and performance in a multithreaded and distributed environment.
- Participating in code reviews and providing constructive feedback to peers.
What We’re Looking For
- B.S., M.S. or PhD in Computer Science or in a related technical field.
- 5+ years of software engineering experience, with a focus on database systems, query execution, or related fields.
- Strong programming skills in an object-oriented language such as Java or C++.
- Understanding of database internals, query planning, distributed systems, concurrency control, data replication, and storage systems.
- Familiarity with cloud object stores, such as AWS S3, ADLS, or GCS.
- Experience with Apache Iceberg, Parquet, AVRO, and/or Delta.
- Interested and motivated to be part of a fast-moving startup with a fun and accomplished team.
- Desire to learn: You can stand your ground as well as be mentored by your teammates.
Bonus Points
- Experience working with and contributing to open source projects.
What We Value
At Dremio, we hold ourselves to high standards when it comes to People, Thinking, and Action. Our Gnarlies (that's what we call our employees) communicate with clarity, drive accountability, and are respectful towards each other. We confront brutal facts and focus on results while operating with a sense of urgency and building a "flywheel". People who like to jump in and drive momentum will thrive in our #GnarlyLife.
Dremio is an equal opportunity employer supporting workforce diversity. We do not discriminate on the basis of race, religion, color, national origin, gender identity, sexual orientation, age, marital status, protected veteran status, disability status, or any other unlawful factor.
Dremio is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process.
Benefits Extracted with AI
- Equal opportunity employer
- Workforce diversity
- Accommodations for disabilities
Similar jobs
Last update: 23 minutes ago
Senior Software Engineer - Polaris & Data Lake Catalog
Join Snowflake as a Senior Software Engineer to build and evolve our open data lake ecosystem with Java, Scala, and C++.
Senior Software Engineer - App Foundation (Database)
Join Snowflake as a Senior Software Engineer focusing on database systems, enhancing backend services for Snowsight.
Senior Data Engineer (Java/Scala)
Join smartclip as a Senior Data Engineer to design scalable big data solutions using Java, Scala, and Spark. Remote work available.
Staff Software Engineer, Data Infrastructure
Senior Data Infrastructure Engineer at Airbnb, focusing on data engineering tools and frameworks, remote eligible.
Senior Software Development Engineer - Aurora Limitless Database
Join AWS as a Senior Software Development Engineer to innovate in cloud database services with Aurora Limitless Database.
Lead Data Engineer – Data Platform
Lead Data Engineer role in Berlin, focusing on data platform scalability and efficiency, with skills in Kubernetes, Scala, and Apache Spark.
Graduate Software Development Engineer – Redshift Query Processing
Join AWS as a Graduate Software Development Engineer in Berlin, focusing on Redshift Query Processing. Develop cutting-edge cloud data solutions.
Senior Backend/Data Engineer
Join Zalando as a Senior Backend/Data Engineer in Berlin to enhance our audience-building platform using AWS, Java, Scala, and SQL.
Senior Software Engineer, Database Engine and Semantic Data Modeling
Senior Software Engineer role focusing on database engine and semantic data modeling, remote position.
Senior Software Engineer, Data Engineering
Join Grammarly as a Senior Software Engineer in Data Engineering, focusing on building data pipelines and infrastructure.
Software Development Engineer - High-Performance Query Processing
Join Amazon Redshift as a Software Development Engineer focusing on high-performance query processing. Work on cutting-edge distributed data processing algorithms.
Senior Data Engineer
Join celver AG as a Senior Data Engineer to design and build Smart Data/Analytics platforms. Work with Python, SQL, and more in a dynamic environment.
Software Engineer - MLOps
Join Dataiku as a Software Engineer in Berlin, focusing on MLOps features and capabilities. Enhance ML model automation and interfaces.
Software Development Engineer
Join Adobe as a Software Development Engineer in San Jose, CA, focusing on high-performance segmentation engines and query optimization.
Senior Software Engineer - Data Pipeline Team
Senior Software Engineer for Data Pipeline team, remote work, expertise in Python, NoSQL, Big Data technologies.
Staff Software Engineer - Backend
Staff Software Engineer - Backend role at Databricks, focusing on Java, Scala, and cloud technologies in Seattle, WA.
Senior Software Engineer - Polaris & Data Lake Catalog
Join Snowflake as a Senior Software Engineer to build and evolve our open data lake ecosystem with Polaris.
Senior Software Engineer - Data Platform
Join Discord as a Senior Software Engineer on the Data Platform team, working with GCP, Airflow, and BigQuery.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Lead Data Engineer with GCP Expertise
Lead Data Engineer role in Berlin, focusing on GCP, BigQuery, and data pipelines.
Senior Data Engineer
Senior Data Engineer with expertise in Scala, Java, Spark, and Big Data technologies. Based in Berlin, Germany.
Software Engineer - AI & Machine Learning
Join Dataiku as a Software Engineer in AI & Machine Learning, working with Java, Scala, and Angular in a remote role.
Senior Software Engineer, Platform
Senior Software Engineer role focusing on platform development with skills in TypeScript, Apache Airflow, and distributed systems.
Senior Data Platform Engineer
Senior Data Platform Engineer needed for Blip, focusing on Big Data management and cloud solutions. Expertise in SQL, Python, Spark, and cloud platforms required.