Senior Data Engineer
Stability AIAbout the Role
We are seeking a talented Senior Data Engineer to join our Data team at Stability AI. This role is pivotal in building and maintaining the data infrastructure that supports the training of our AI models. You will work remotely from Germany, collaborating with a multidisciplinary team of research scientists and machine learning engineers to enhance and scale our model efficiency.
Responsibilities
- Data Preparation: Clean, normalize, and preprocess data in a scalable, parallelizable manner to prepare it for ingestion into our machine learning model training pipelines, ensuring data quality.
- Infrastructure Development: Design, implement, and maintain scalable data infrastructure for generative AI.
- Tool Development: Develop tools to search and serve data at scale.
- Collaboration: Work with cross-functional teams to understand and meet their data requirements.
- Pipeline Management: Develop and manage data processing pipelines to support machine learning teams.
- Data Quality: Maintain and improve data quality and integrity across various databases and data stores.
- Data Management: Manage and organize large-scale unstructured data, including image, text, audio, video, and 3D.
Qualifications
- Proven experience with large-scale distributed workloads.
- Experience with large-scale data loading for machine learning training runs.
- Proficiency in cloud storage and file systems, with a preference for AWS (S3).
- Strong experience with Python.
- Expertise in large-scale data processing and software development for unstructured data.
- Knowledge of database, data lake, and data warehouse technologies such as Redshift, BigQuery, and Snowflake.
- Experience with machine learning projects, ideally with some deep learning or computer vision knowledge.
- Excellent teamwork and communication skills, especially in a distributed international team setting.
- Attention to detail and ability to document processes and solutions effectively.
Equal Employment Opportunity
Stability AI is an equal opportunity employer. We do not discriminate based on race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.
Join us in building open AI tools that empower humanity to reach its full potential.
Benefits Extracted with AI
- Remote work
- Equal opportunity employer
Similar jobs
Last update: 23 minutes ago
Remote Data Engineer - Research
Join Stability AI as a Remote Data Engineer to build scalable data infrastructure for AI models.
Senior Backend Engineer (AI)
Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.
FullStack Software Developer
Join SPREAD AI as a FullStack Software Developer in Berlin. Work with JavaScript, Python, Go, and more in a hybrid setup.
Senior AI Engineer - Backend
Join Keboola as a Senior AI Engineer to enhance AI features, develop models, and collaborate on innovative projects in Prague.
FullStack Software Developer
Join SPREAD AI as a FullStack Software Developer to innovate in data management and engineering intelligence.
Senior Data Engineer
Join us as a Senior Data Engineer in Lisbon to design and maintain data infrastructure. Hybrid role with flexible benefits.
Senior ML Infrastructure Engineer
Join CHAI: AI Platform as a Senior ML Infrastructure Engineer to build and scale ML systems in Palo Alto.
Machine Learning Engineer
Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.
Senior Software Engineer, Backend
Join Standard AI as a Senior Backend Engineer to design scalable microservices and APIs. Remote role with competitive salary and benefits.
Senior Fullstack Software Engineer, GenAI Horizontal Task Tooling
Join Scale AI as a Senior Fullstack Software Engineer to build web-based applications for AI data annotation.
Software Engineer - Machine Learning
Join OUTFITTERY as a Software Engineer in Machine Learning, focusing on AI solutions for fashion. Remote work and flexible hours offered.
Machine Learning Engineer with AI/ML Experience
Join us as a Machine Learning Engineer to develop AI/ML models and applications. Work remotely with top-tier companies.
AI/ML Data Engineer
Join MarketWise as an AI/ML Data Engineer to develop data pipelines and ETL processes using Python and cloud platforms.
Senior Machine Learning Engineer
Join as a Senior Machine Learning Engineer to design and deploy advanced ML solutions using Python, Spark, and cloud platforms. Remote work opportunity.
Lead Architect - Gen AI API Platform
Lead Architect for Gen AI API platform, focusing on AWS, REST APIs, and AI/ML infrastructure. Remote role with competitive salary.
Senior Data Engineer with Azure Expertise
Join Eliq as a Senior Data Engineer to enhance our Azure-based data platform and drive the energy transition.
Senior Machine Learning Engineer
Join Atypon as a Senior ML Engineer to develop AI solutions in NLP, deep learning, and MLOps. Remote position in Athens.
Senior Distributed Systems Engineer
Join webAI as a Senior Distributed Systems Engineer to design and maintain scalable systems using Python, Kubernetes, and more.
Senior AI/ML Engineer
Join 3Pillar as a Senior AI/ML Engineer to develop innovative AI solutions in a remote, global team.
Data Engineer with AWS, Java, and Python
Join Poppi Technologies as a Data Engineer in Valenzano, Italy. Work with AWS, Java, and Python to drive AI in finance.
Senior Machine Learning Engineer
Join Echo Analytics as a Senior Machine Learning Engineer in Paris. Leverage ML to drive data modeling and design intelligent data flows.
Data Engineer II
Join Accolade as a Data Engineer II in Prague. Design and maintain cloud-native data infrastructure using AWS and modern technologies.
AI Solutions Software Engineer
Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama.
Staff Engineer - Python, Cloud, Distributed Systems
Join Keelvar as a Staff Engineer to lead design and architecture in a remote role, focusing on Python, cloud, and distributed systems.