Stability AI logo

Remote Data Engineer - Research

Stability AI

About the Role

We are seeking a talented Data Engineer to join our Data team at Stability AI. This role is pivotal in enhancing and scaling the efficiency of our models. You will work closely with a multidisciplinary team of research scientists and machine learning engineers to build and maintain the data infrastructure that supports the training of all Stability AI models. This position is remote and based in Germany.

Responsibilities

  • Data Preparation: Clean, normalize, and preprocess data in a scalable, parallelizable manner to prepare it for ingestion into our machine learning model training pipelines, ensuring data quality.
  • Infrastructure Development: Design, implement, and maintain scalable data infrastructure for generative AI.
  • Tool Development: Develop tools to search and serve data at scale.
  • Collaboration: Work with multiple research teams to understand and meet their data requirements.
  • Pipeline Management: Develop and manage data processing pipelines to support machine learning teams.
  • Data Quality: Maintain and improve data quality and integrity across various databases and data stores.
  • Data Management: Manage and organize large-scale unstructured data, including image, text, audio, video, and 3D.

Qualifications

  • Proven experience with large-scale distributed workloads.
  • Experience with large-scale data loading for machine learning training runs.
  • Proficiency in cloud storage and file systems, with a preference for AWS (S3).
  • Strong experience with Python.
  • Expertise in database, data lake, and data warehouse technologies such as Redshift, BigQuery, and Snowflake.
  • Experience working on machine learning projects, with some knowledge of deep learning and computer vision.
  • Excellent teamwork and communication skills, especially in a distributed international team setting.
  • Attention to detail and the ability to document processes and solutions effectively.

Equal Employment Opportunity

Stability AI is an equal opportunity employer. We do not discriminate based on race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.

Join us in building the foundation to activate humanity’s potential through open AI tools and solutions.

Benefits
Extracted with AI

  • Remote work
  • Equal Employment Opportunity

Similar jobs

Last update: 23 minutes ago

Stability AI logo
Stability AI

Senior Data Engineer

Join Stability AI as a Senior Data Engineer to build scalable data infrastructure for AI models. Remote work from Germany.

Stability AI logo
Stability AI

Senior Backend Engineer (AI)

Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.

OfferFit logo
OfferFit

Machine Learning Engineer

Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.

Remote Crew logo
Remote Crew

Senior Data Engineer

Join us as a Senior Data Engineer in Lisbon to design and maintain data infrastructure. Hybrid role with flexible benefits.

SPREAD AI logo
SPREAD AI

FullStack Software Developer

Join SPREAD AI as a FullStack Software Developer to innovate in data management and engineering intelligence.

micro1 logo
micro1

Machine Learning Engineer with AI/ML Experience

Join us as a Machine Learning Engineer to develop AI/ML models and applications. Work remotely with top-tier companies.

SPREAD AI logo
SPREAD AI

FullStack Software Developer

Join SPREAD AI as a FullStack Software Developer in Berlin. Work with JavaScript, Python, Go, and more in a hybrid setup.

Keboola logo
Keboola

Senior AI Engineer - Backend

Join Keboola as a Senior AI Engineer to enhance AI features, develop models, and collaborate on innovative projects in Prague.

Messari logo
Messari

Data Engineer with Blockchain and Cloud Experience

Join Messari as a Data Engineer to design blockchain data models, build dashboards, and derive insights. Remote role with competitive benefits.

MarketWise logo
MarketWise

AI/ML Data Engineer

Join MarketWise as an AI/ML Data Engineer to develop data pipelines and ETL processes using Python and cloud platforms.

OUTFITTERY logo
OUTFITTERY

Software Engineer - Machine Learning

Join OUTFITTERY as a Software Engineer in Machine Learning, focusing on AI solutions for fashion. Remote work and flexible hours offered.

SSi People logo
SSi People

Senior Machine Learning Engineer

Join as a Senior Machine Learning Engineer to design and deploy advanced ML solutions using Python, Spark, and cloud platforms. Remote work opportunity.

Poppi Technologies logo
Poppi Technologies

Data Engineer with AWS, Java, and Python

Join Poppi Technologies as a Data Engineer in Valenzano, Italy. Work with AWS, Java, and Python to drive AI in finance.

Compliance & Risks logo
Compliance & Risks

Head of Data Science

Lead our Data Science team in Ireland, driving AI-powered compliance solutions. Remote work, diverse workplace, and growth opportunities.

Leonardo.Ai logo
Leonardo.Ai

Mid-Level AI Researcher

Join Leonardo.Ai as a Mid-Level AI Researcher to develop and refine AI models, focusing on model training and optimization.

DwellFi  logo
DwellFi

AI Solutions Software Engineer

Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama.

CHAI: AI Platform logo
CHAI: AI Platform

Senior ML Infrastructure Engineer

Join CHAI: AI Platform as a Senior ML Infrastructure Engineer to build and scale ML systems in Palo Alto.

ClimateAi logo
ClimateAi

Applied AI Scientist

Join ClimateAi as an Applied AI Scientist to develop AI solutions for climate resilience. Work with diverse teams in a hybrid environment.

Helm.ai logo
Helm.ai

Remote Software Engineer - Machine Learning and Cloud Infrastructure

Join Helm.ai as a Remote Software Engineer to develop ML tools, build cloud infrastructure, and work on AI technology.

PlushCare logo
PlushCare

Data Engineer II

Join Accolade as a Data Engineer II in Prague. Design and maintain cloud-native data infrastructure using AWS and modern technologies.

Keelvar logo
Keelvar

Staff Engineer - Python, Cloud, Distributed Systems

Join Keelvar as a Staff Engineer to lead design and architecture in a remote role, focusing on Python, cloud, and distributed systems.

Standard AI logo
Standard AI

Senior Software Engineer, Backend

Join Standard AI as a Senior Backend Engineer to design scalable microservices and APIs. Remote role with competitive salary and benefits.

Thoughtworks logo
Thoughtworks

Senior Data Scientist (Contractor)

Join Thoughtworks as a Senior Data Scientist (Contractor) to solve complex business problems using data science and machine learning.

Twitch logo
Twitch

Data Scientist

Join Twitch as a Data Scientist to drive insights and analytics in a remote role. Leverage SQL, Python, and data visualization skills.