Stability AI logo

Remote Data Engineer - Research

Stability AI

About the Role

We are seeking a talented Data Engineer to join our Data team at Stability AI. This role is pivotal in enhancing and scaling the efficiency of our models. You will work closely with a multidisciplinary team of research scientists and machine learning engineers to build and maintain the data infrastructure that supports the training of all Stability AI models. This position is remote and based in Germany.

Responsibilities

  • Data Preparation: Clean, normalize, and preprocess data in a scalable, parallelizable manner to prepare it for ingestion into our machine learning model training pipelines, ensuring data quality.
  • Infrastructure Development: Design, implement, and maintain scalable data infrastructure for generative AI.
  • Tool Development: Develop tools to search and serve data at scale.
  • Collaboration: Work with multiple research teams to understand and meet their data requirements.
  • Pipeline Management: Develop and manage data processing pipelines to support machine learning teams.
  • Data Quality: Maintain and improve data quality and integrity across various databases and data stores.
  • Data Management: Manage and organize large-scale unstructured data, including image, text, audio, video, and 3D.

Qualifications

  • Proven experience with large-scale distributed workloads.
  • Experience with large-scale data loading for machine learning training runs.
  • Proficiency in cloud storage and file systems, with a preference for AWS (S3).
  • Strong experience with Python.
  • Expertise in database, data lake, and data warehouse technologies such as Redshift, BigQuery, and Snowflake.
  • Experience working on machine learning projects, with some knowledge of deep learning and computer vision.
  • Excellent teamwork and communication skills, especially in a distributed international team setting.
  • Attention to detail and the ability to document processes and solutions effectively.

Equal Employment Opportunity

Stability AI is an equal opportunity employer. We do not discriminate based on race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.

Join us in building the foundation to activate humanity’s potential through open AI tools and solutions.

Benefits
Extracted with AI

  • Remote work
  • Equal Employment Opportunity

Similar jobs

Last update: 23 minutes ago

Stability AI logo
Stability AI

Senior Data Engineer

Join Stability AI as a Senior Data Engineer to build scalable data infrastructure for AI models. Remote work from Germany.

Stability AI logo
Stability AI

Senior Data Platform Engineer

Senior Data Platform Engineer specializing in AWS and GCP services, data pipelines, and cloud infrastructure.

Stability AI logo
Stability AI

Senior Backend Engineer (AI)

Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.

Remote logo
Remote

Senior Analytics Engineer

Join Remote as a Senior Analytics Engineer to drive impactful decision-making with data analytics and engineering.

Stability AI logo
Stability AI

Site Reliability Engineer (SRE) - Stability AI

Join Stability AI as a Site Reliability Engineer (SRE) to enhance cloud infrastructure and system reliability. Remote work available.

Airbnb logo
Airbnb

Staff Software Engineer, Data Infrastructure

Senior Data Infrastructure Engineer at Airbnb, focusing on data engineering tools and frameworks, remote eligible.

Hostaway logo
Hostaway

Remote Data Engineer - Google BigQuery

Join Hostaway as a Remote Data Engineer, leveraging Google BigQuery and Python to optimize data infrastructure and support revenue operations.

Pruna AI logo
Pruna AI

MLOps Engineer

Join Pruna AI as an MLOps Engineer to optimize machine learning infrastructure and enhance AI operations remotely.

OfferFit logo
OfferFit

Machine Learning Engineer

Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.

Wunderflats logo
Wunderflats

Senior Data Engineer (f/m/d)

Senior Data Engineer needed in Berlin. Expertise in Python, SQL, Data Modeling, and ETL required. Hybrid work policy.

Algolia logo
Algolia

Senior Data Engineer

Join Algolia as a Senior Data Engineer to design and scale data pipelines using Python, Airflow, and AWS technologies.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!

Helm.ai logo
Helm.ai

Remote Software Engineer - Machine Learning and Cloud Infrastructure

Join Helm.ai as a Remote Software Engineer to develop ML tools, build cloud infrastructure, and work on AI technology.

Dataiku logo
Dataiku

Software Engineer - AI & Machine Learning

Join Dataiku as a Software Engineer in AI & Machine Learning, working with Java, Scala, and Angular in a remote role.

Axmed logo
Axmed

Senior Cloud Data Engineer

Senior Cloud Data Engineer role focusing on data architecture, pipeline design, and cloud platforms like AWS and Snowflake.

Mozilla.ai logo
Mozilla.ai

Remote Machine Learning Engineer

Join Mozilla.ai as a Remote Machine Learning Engineer to develop scalable AI solutions with open-source tools.

OpenAI logo
OpenAI

Senior Data Engineer - Real Estate and Workplace

Senior Data Engineer for Real Estate and Workplace at OpenAI, skilled in ETL, Apache Spark, and Airflow.

GitHub logo
GitHub

Software Engineer II, Data Engineering

Join GitHub as a Software Engineer II in Data Engineering, focusing on data pipelines with Python, SQL, Airflow, and Spark.

Airbnb logo
Airbnb

Staff Data Engineer, Guest & Host Products

Staff Data Engineer role at Airbnb focusing on data integrity for marketplaces, including data modeling and pipeline construction.

Fullstory logo
Fullstory

Senior Data Engineer

Senior Data Engineer role focusing on ETL, Python, and Big Data in a remote setting with comprehensive benefits.

Mozilla.ai logo
Mozilla.ai

Remote Machine Learning Engineer

Join Mozilla.ai as a Remote Machine Learning Engineer to develop scalable AI solutions with open-source tools.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Mozilla.ai logo
Mozilla.ai

Remote Machine Learning Engineer

Join Mozilla.ai as a Remote Machine Learning Engineer to develop scalable AI solutions with open-source tools.

smartclip logo
smartclip

Senior Data Engineer (Java/Scala)

Join smartclip as a Senior Data Engineer to design scalable big data solutions using Java, Scala, and Spark. Remote work available.