About the Role
We are seeking a talented Senior Data Engineer to join our Data team at Stability AI. This role is pivotal in building and maintaining the data infrastructure that supports the training of our AI models. You will work remotely from Germany, collaborating with a multidisciplinary team of research scientists and machine learning engineers to enhance and scale our model efficiency.
Responsibilities
- Data Preparation: Clean, normalize, and preprocess data in a scalable, parallelizable manner to prepare it for ingestion into our machine learning model training pipelines, ensuring data quality.
- Infrastructure Development: Design, implement, and maintain scalable data infrastructure for generative AI.
- Tool Development: Develop tools to search and serve data at scale.
- Collaboration: Work with cross-functional teams to understand and meet their data requirements.
- Pipeline Management: Develop and manage data processing pipelines to support machine learning teams.
- Data Quality: Maintain and improve data quality and integrity across various databases and data stores.
- Data Management: Manage and organize large-scale unstructured data, including image, text, audio, video, and 3D.
Qualifications
- Proven experience with large-scale distributed workloads.
- Experience with large-scale data loading for machine learning training runs.
- Proficiency in cloud storage and file systems, with a preference for AWS (S3).
- Strong experience with Python.
- Expertise in large-scale data processing and software development for unstructured data.
- Knowledge of database, data lake, and data warehouse technologies such as Redshift, BigQuery, and Snowflake.
- Experience with machine learning projects, ideally with some deep learning or computer vision knowledge.
- Excellent teamwork and communication skills, especially in a distributed international team setting.
- Attention to detail and ability to document processes and solutions effectively.
Equal Employment Opportunity
Stability AI is an equal opportunity employer. We do not discriminate based on race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.
Join us in building open AI tools that empower humanity to reach its full potential.
Benefits Extracted with AI
- Remote work
- Equal opportunity employer
Similar jobs
Last update: 23 minutes ago
Remote Data Engineer - Research
Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.
Senior Data Platform Engineer
Senior Data Platform Engineer specializing in AWS and GCP services, data pipelines, and cloud infrastructure.
Senior Backend Engineer (AI)
Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.
Senior Data Engineer (f/m/d)
Senior Data Engineer needed in Berlin. Expertise in Python, SQL, Data Modeling, and ETL required. Hybrid work policy.
Senior Data Engineer
Join Algolia as a Senior Data Engineer to design and scale data pipelines using Python, Airflow, and AWS technologies.
MLOps Engineer
Join Pruna AI as an MLOps Engineer to optimize machine learning infrastructure and enhance AI operations remotely.
Site Reliability Engineer (SRE) - Stability AI
Join Stability AI as a Site Reliability Engineer (SRE) to enhance cloud infrastructure and system reliability. Remote work available.
Senior Data Engineer (Java/Scala)
Join smartclip as a Senior Data Engineer to design scalable big data solutions using Java, Scala, and Spark. Remote work available.
Senior Backend/Data Engineer
Join Zalando as a Senior Backend/Data Engineer in Berlin to enhance our audience-building platform using AWS, Java, Scala, and SQL.
Senior Data Engineer
Join Simon-Kucher as a Senior Data Engineer in Berlin. Design scalable data architectures and drive digital transformation.
Senior Data Engineer - Real Estate and Workplace
Senior Data Engineer for Real Estate and Workplace at OpenAI, skilled in ETL, Apache Spark, and Airflow.
Senior Machine Learning Engineer
Senior Machine Learning Engineer role focusing on AI, ML model deployment, and cloud solutions in Berlin.
Data Engineer
Join Etribes as a Data Engineer in Hamburg. Work on data pipelines, analytics, and cloud platforms. Flexible work, training, and benefits offered.
Senior Software Engineer, Data Engineering
Join Grammarly as a Senior Software Engineer in Data Engineering, focusing on building data pipelines and infrastructure.
Senior Data Engineer
Join Taxfix as a Senior Data Engineer in Berlin to build scalable data platforms for ML and analytics.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Senior Data Engineer
Senior Data Engineer role focusing on ETL, Python, and Big Data in a remote setting with comprehensive benefits.
Lead Data Engineer with GCP Expertise
Lead Data Engineer role in Berlin, focusing on GCP, BigQuery, and data pipelines.
Senior Data Engineer
Senior Data Engineer with expertise in Scala, Java, Spark, and Big Data technologies. Based in Berlin, Germany.
Senior Data Engineer
Join Heyflow as a Senior Data Engineer to transform data into insights using GCP, Python, and SQL in a hybrid work environment.
Senior Data Engineer
Join celver AG as a Senior Data Engineer to design and build Smart Data/Analytics platforms. Work with Python, SQL, and more in a dynamic environment.
Machine Learning Engineer
Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.
Senior Software Engineer, Machine Learning Infrastructure
Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.
Cloud Data Engineer
Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!