Stability AI logo

Senior Data Engineer

Stability AI

About the Role

We are seeking a talented Senior Data Engineer to join our Data team at Stability AI. This role is pivotal in building and maintaining the data infrastructure that supports the training of our AI models. You will work remotely from Germany, collaborating with a multidisciplinary team of research scientists and machine learning engineers to enhance and scale our model efficiency.

Responsibilities

  • Data Preparation: Clean, normalize, and preprocess data in a scalable, parallelizable manner to prepare it for ingestion into our machine learning model training pipelines, ensuring data quality.
  • Infrastructure Development: Design, implement, and maintain scalable data infrastructure for generative AI.
  • Tool Development: Develop tools to search and serve data at scale.
  • Collaboration: Work with cross-functional teams to understand and meet their data requirements.
  • Pipeline Management: Develop and manage data processing pipelines to support machine learning teams.
  • Data Quality: Maintain and improve data quality and integrity across various databases and data stores.
  • Data Management: Manage and organize large-scale unstructured data, including image, text, audio, video, and 3D.

Qualifications

  • Proven experience with large-scale distributed workloads.
  • Experience with large-scale data loading for machine learning training runs.
  • Proficiency in cloud storage and file systems, with a preference for AWS (S3).
  • Strong experience with Python.
  • Expertise in large-scale data processing and software development for unstructured data.
  • Knowledge of database, data lake, and data warehouse technologies such as Redshift, BigQuery, and Snowflake.
  • Experience with machine learning projects, ideally with some deep learning or computer vision knowledge.
  • Excellent teamwork and communication skills, especially in a distributed international team setting.
  • Attention to detail and ability to document processes and solutions effectively.

Equal Employment Opportunity

Stability AI is an equal opportunity employer. We do not discriminate based on race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.

Join us in building open AI tools that empower humanity to reach its full potential.

Benefits
Extracted with AI

  • Remote work
  • Equal opportunity employer

Similar jobs

Last update: 23 minutes ago

Stability AI logo
Stability AI

Remote Data Engineer - Research

Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.

Stability AI logo
Stability AI

Senior Data Platform Engineer

Senior Data Platform Engineer specializing in AWS and GCP services, data pipelines, and cloud infrastructure.

Stability AI logo
Stability AI

Senior Backend Engineer (AI)

Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.

Wunderflats logo
Wunderflats

Senior Data Engineer (f/m/d)

Senior Data Engineer needed in Berlin. Expertise in Python, SQL, Data Modeling, and ETL required. Hybrid work policy.

Algolia logo
Algolia

Senior Data Engineer

Join Algolia as a Senior Data Engineer to design and scale data pipelines using Python, Airflow, and AWS technologies.

Pruna AI logo
Pruna AI

MLOps Engineer

Join Pruna AI as an MLOps Engineer to optimize machine learning infrastructure and enhance AI operations remotely.

Stability AI logo
Stability AI

Site Reliability Engineer (SRE) - Stability AI

Join Stability AI as a Site Reliability Engineer (SRE) to enhance cloud infrastructure and system reliability. Remote work available.

smartclip logo
smartclip

Senior Data Engineer (Java/Scala)

Join smartclip as a Senior Data Engineer to design scalable big data solutions using Java, Scala, and Spark. Remote work available.

Zalando logo
Zalando

Senior Backend/Data Engineer

Join Zalando as a Senior Backend/Data Engineer in Berlin to enhance our audience-building platform using AWS, Java, Scala, and SQL.

Simon Kucher logo
Simon Kucher

Senior Data Engineer

Join Simon-Kucher as a Senior Data Engineer in Berlin. Design scalable data architectures and drive digital transformation.

OpenAI logo
OpenAI

Senior Data Engineer - Real Estate and Workplace

Senior Data Engineer for Real Estate and Workplace at OpenAI, skilled in ETL, Apache Spark, and Airflow.

Riverty logo
Riverty

Senior Machine Learning Engineer

Senior Machine Learning Engineer role focusing on AI, ML model deployment, and cloud solutions in Berlin.

Etribes logo
Etribes

Data Engineer

Join Etribes as a Data Engineer in Hamburg. Work on data pipelines, analytics, and cloud platforms. Flexible work, training, and benefits offered.

Grammarly logo
Grammarly

Senior Software Engineer, Data Engineering

Join Grammarly as a Senior Software Engineer in Data Engineering, focusing on building data pipelines and infrastructure.

Taxfix logo
Taxfix

Senior Data Engineer

Join Taxfix as a Senior Data Engineer in Berlin to build scalable data platforms for ML and analytics.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Fullstory logo
Fullstory

Senior Data Engineer

Senior Data Engineer role focusing on ETL, Python, and Big Data in a remote setting with comprehensive benefits.

Almedia logo
Almedia

Lead Data Engineer with GCP Expertise

Lead Data Engineer role in Berlin, focusing on GCP, BigQuery, and data pipelines.

Riverty logo
Riverty

Senior Data Engineer

Senior Data Engineer with expertise in Scala, Java, Spark, and Big Data technologies. Based in Berlin, Germany.

Heyflow logo
Heyflow

Senior Data Engineer

Join Heyflow as a Senior Data Engineer to transform data into insights using GCP, Python, and SQL in a hybrid work environment.

celver AG logo
celver AG

Senior Data Engineer

Join celver AG as a Senior Data Engineer to design and build Smart Data/Analytics platforms. Work with Python, SQL, and more in a dynamic environment.

OfferFit logo
OfferFit

Machine Learning Engineer

Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.

Scale AI logo
Scale AI

Senior Software Engineer, Machine Learning Infrastructure

Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!