About Cantina
Cantina, founded by Sean Parker, is a pioneering social platform featuring the most advanced AI character creator. Our platform allows users to build, share, and interact with AI bots and friends directly within Cantina or across the internet. These bots are lifelike, social creatures capable of interacting wherever humans go online. Whether recreating yourself using powerful AI, imagining someone new, or choosing from thousands of existing characters, Cantina offers a new media type for creators to share infinitely scalable and personalized content experiences, combined with seamless group chat across voice, video, and text.
About the Role
As a Senior Machine Learning Engineer on the Data team, you will focus on two key areas: large-scale data collection and refined dataset creation. You will expertly manage diverse data sources, build robust infrastructure for petabyte-scale data handling, and craft comprehensive training datasets through experimentation and evaluation. This role is central to our AI development process, requiring meticulous attention to detail and the ability to make impactful design decisions.
Responsibilities
- Identify and collect data at scale for different types of speech models.
- Work with large amounts of audio data and different types of speech models, including automatic speech recognition, speaker diarization, and audio classifiers.
- Design and implement robust infrastructure for efficient petabyte-scale data collection, processing, and streaming across multiple clouds.
- Develop ETL tools for managing vast amounts of data.
- Develop innovative solutions for data-related issues in AI model training and implement rigorous data validation processes.
- Ensure data integrity and relevance throughout the pipeline.
- Stay updated with the latest state-of-the-art audio models for speech processing, such as pyannote, parakeet, and whisper.
- Contribute to Cantina’s open-source ML projects and associated communities.
Qualifications
- 5+ years of experience in data platform engineering or a relevant field.
- Experience building large-scale data processing pipelines with tools like PySpark, Beam, or Flink.
- Familiarity with Machine Learning and NLP, with a willingness to learn more on the job.
- Proven track record of adapting to new domains and a desire to use data to improve products.
- Experience as an ML engineer, Data Scientist, or in a similar role.
- Experience with cloud platforms like AWS or Azure, or tools such as Kubernetes and Terraform.
- Passionate about Conversational AI or large language models.
Location
We have offices located in Sunnyvale, CA, San Francisco, CA, and Brooklyn, NY. While we focus on individuals near our office hubs, we offer fully remote and hybrid employment opportunities.
Compensation
In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000 - $250,000 for those located in the San Francisco Bay Area, New York City, and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.
Application Process
Please submit your resume, cover letter, and any relevant portfolio or publications demonstrating your research contributions in AI.
Benefits Extracted with AI
- Health Care — 99% of premiums for medical, vision, dental are fully paid
- One Medical membership
- Monthly Stipend — $500/month
- 15 PTO days per year
- 9 sick days
- 13 paid company holidays
- Offices closed for winter break
- 401(K) — Eligible to participate on day one
- Parental Leave & Fertility Support
- Competitive Salary & Equity
- Lunch and snacks provided for in-office employees
- WFH equipment provided for full-time hybrid/remote employees
Similar jobs
Last update: 23 minutes ago
Expert Machine Learning Engineer
Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Cloud Data Engineer
Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!
AI Engineer
Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.
Staff Software Engineer, Data Platform
Join Personio as a Staff Software Engineer in Berlin to build scalable data platforms using Kafka, Kubernetes, and AWS. Drive innovation and excellence.
Principal AI Engineer
Join Cere Network as a Principal AI Engineer to drive AI innovation in Web3. Requires 10+ years in AI/ML, NLP, and software development.
Senior Software Engineer - C#/.NET
Join TrueLayer as a Senior Software Engineer in Milan, working with C#, .NET, AWS, and Kubernetes to build scalable systems.
Senior Software Engineer - AWS, Python, Ruby on Rails
Join HeyJobs as a Senior Software Engineer to design scalable systems using AWS, Python, and Ruby on Rails in a dynamic team.
Senior AI Engineer
Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.
AI Solutions Software Engineer
Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama. Remote position in Palo Alto, CA.
Senior Backend Engineer - Java, Rust, Go
Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.
Senior Software Engineer - Data Platform
Join Nubank as a Senior Software Engineer to build and maintain core data infrastructure, ensuring reliable and scalable data flow.
Senior Software Engineer - Python, Django, Angular
Join Ilkari as a Senior Software Engineer to lead development in Python, Django, and Angular, creating scalable solutions in a hybrid work environment.
Senior Software Engineer (Node.js & TypeScript)
Join n8n as a Senior Software Engineer to build AI applications using Node.js and TypeScript. Remote role within Europe.
Senior Product Engineer [Rust & Typescript]
Join Attio as a Senior Product Engineer working with Rust & TypeScript to build innovative CRM features. Remote work available.
Software Engineer II - Developer Experience
Join Elastic as a Software Engineer II in Developer Experience, focusing on test frameworks for Kibana. Remote work, competitive benefits.
Machine Learning Engineer
Join MoonPay as a Machine Learning Engineer to build and maintain ML infrastructure, collaborating with data scientists and cross-functional teams.
Senior Full Stack Engineer (Java, React, MySQL)
Join LILT as a Senior Full Stack Engineer, working with Java, React, and MySQL to drive AI translation solutions. Remote with future hybrid work.
Senior Software Engineer - LLM
Join Snowflake as a Senior Software Engineer to build scalable machine learning platforms with LLMs, leveraging Python and TensorFlow.
Software Engineer - Cloud Applications and Python
Join Topicus as a Software Engineer in Arnhem to develop cloud applications using Python, REST APIs, and ETL processes for healthcare data services.
Senior Software Engineer - Python, Apache Kafka
Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.
Senior Cloud DevOps Engineer
Join netgo as a Senior Cloud DevOps Engineer in Berlin. Work with Kubernetes, GitOps, and more in a dynamic team environment.
Senior Software Engineer - LLM
Join Snowflake as a Senior Software Engineer to build scalable machine learning platforms with LLMs, leveraging Python and TensorFlow.