Remote Data Engineer - Research
Stability AIAbout the Role
We are seeking a talented Data Engineer to join our Data team at Stability AI. This role is pivotal in enhancing and scaling the efficiency of our models. You will work closely with a multidisciplinary team of research scientists and machine learning engineers to build and maintain the data infrastructure that supports the training of all Stability AI models. This position is remote and based in Germany.
Responsibilities
- Data Preparation: Clean, normalize, and preprocess data in a scalable, parallelizable manner to prepare it for ingestion into our machine learning model training pipelines, ensuring data quality.
- Infrastructure Development: Design, implement, and maintain scalable data infrastructure for generative AI.
- Tool Development: Develop tools to search and serve data at scale.
- Collaboration: Work with multiple research teams to understand and meet their data requirements.
- Pipeline Management: Develop and manage data processing pipelines to support machine learning teams.
- Data Quality: Maintain and improve data quality and integrity across various databases and data stores.
- Data Management: Manage and organize large-scale unstructured data, including image, text, audio, video, and 3D.
Qualifications
- Proven experience with large-scale distributed workloads.
- Experience with large-scale data loading for machine learning training runs.
- Proficiency in cloud storage and file systems, with a preference for AWS (S3).
- Strong experience with Python.
- Expertise in database, data lake, and data warehouse technologies such as Redshift, BigQuery, and Snowflake.
- Experience working on machine learning projects, with some knowledge of deep learning and computer vision.
- Excellent teamwork and communication skills, especially in a distributed international team setting.
- Attention to detail and the ability to document processes and solutions effectively.
Equal Employment Opportunity
Stability AI is an equal opportunity employer. We do not discriminate based on race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or other legally protected statuses.
Join us in building the foundation to activate humanity’s potential through open AI tools and solutions.
Benefits Extracted with AI
- Remote work
- Equal Employment Opportunity
Similar jobs
Last update: 23 minutes ago
Senior Data Engineer
Join Stability AI as a Senior Data Engineer to build scalable data infrastructure for AI models. Remote work from Germany.
Senior Backend Engineer (AI)
Join Stability AI as a Senior Backend Engineer to develop REST APIs and AI/ML services for Generative AI models.
Machine Learning Engineer
Join OfferFit as a Machine Learning Engineer to design and scale AI platforms. Work remotely with a focus on Python, MLOps, and data science.
Software Engineer - Machine Learning
Join OUTFITTERY as a Software Engineer in Machine Learning, focusing on AI solutions for fashion. Remote work and flexible hours offered.
Remote Software Engineer - Machine Learning and Cloud Infrastructure
Join Helm.ai as a Remote Software Engineer to develop ML tools, build cloud infrastructure, and work on AI technology.
Senior Machine Learning Engineer
Join as a Senior Machine Learning Engineer to design and deploy advanced ML solutions using Python, Spark, and cloud platforms. Remote work opportunity.
Head of Data Science
Lead our Data Science team in Ireland, driving AI-powered compliance solutions. Remote work, diverse workplace, and growth opportunities.
Remote Machine Learning Engineer
Join Mozilla.ai as a Remote Machine Learning Engineer to develop scalable AI solutions with open-source tools.
Remote Machine Learning Engineer
Join Mozilla.ai as a Remote Machine Learning Engineer to develop scalable AI solutions with open-source tools.
Remote Machine Learning Engineer
Join Mozilla.ai as a Remote Machine Learning Engineer to develop scalable AI solutions with open-source tools.
Applied AI Scientist
Join ClimateAi as an Applied AI Scientist to develop AI solutions for climate resilience. Work with diverse teams in a hybrid environment.
Lead Architect - Gen AI API Platform
Lead Architect for Gen AI API platform, focusing on AWS, REST APIs, and AI/ML infrastructure. Remote role with competitive salary.
Senior Fullstack Software Engineer, GenAI Horizontal Task Tooling
Join Scale AI as a Senior Fullstack Software Engineer to build web-based applications for AI data annotation.
Software Engineer - MLOps
Join Dataiku as a Software Engineer in Berlin, focusing on MLOps features and capabilities. Enhance ML model automation and interfaces.
Mid-Level AI Researcher
Join Leonardo.Ai as a Mid-Level AI Researcher to develop AI models and enhance generative AI platforms.
Senior Machine Learning Engineer
Join Atypon as a Senior ML Engineer to develop AI solutions in NLP, deep learning, and MLOps. Remote position in Athens.
AI Solutions Software Engineer
Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama.
Senior AI/ML Engineer
Join 3Pillar as a Senior AI/ML Engineer to develop innovative AI solutions in a remote, global team.
Founding Data Engineer
Join GovWell as a Founding Data Engineer to build scalable data infrastructure for modernizing government services.
Part-time Data Scientist II (Python and ML)
Join Fearless as a Part-time Data Scientist II, specializing in Python and ML, to build data-driven solutions.
Senior Machine Learning Engineer
Join Echo Analytics as a Senior Machine Learning Engineer in Paris. Leverage ML to drive data modeling and design intelligent data flows.
MLOps Engineer
Join Pruna AI as an MLOps Engineer to optimize machine learning infrastructure and enhance AI operations remotely.
Senior Machine Learning Scientist
Join Snowflake as a Senior ML Scientist to lead machine learning initiatives, apply AI & ML to business data, and mentor junior scientists.
Remote Machine Learning Engineer
Join Kraken as a Remote Machine Learning Engineer to innovate AI-powered features in the energy sector.