About Cantina
Cantina, founded by Sean Parker, is a pioneering social platform featuring the most advanced AI character creator. Our platform allows users to build, share, and interact with AI bots and friends directly within Cantina or across the internet. These bots are lifelike, social creatures capable of interacting wherever humans go online. Whether recreating yourself using powerful AI, imagining someone new, or choosing from thousands of existing characters, Cantina offers a new media type for creators to share infinitely scalable and personalized content experiences, combined with seamless group chat across voice, video, and text.
About the Role
As a Senior Machine Learning Engineer on the Data team, you will focus on two key areas: large-scale data collection and refined dataset creation. You will expertly manage diverse data sources, build robust infrastructure for petabyte-scale data handling, and craft comprehensive training datasets through experimentation and evaluation. This role is central to our AI development process, requiring meticulous attention to detail and the ability to make impactful design decisions.
Responsibilities
- Identify and collect data at scale for different types of speech models.
- Work with large amounts of audio data and different types of speech models, including automatic speech recognition, speaker diarization, and audio classifiers.
- Design and implement robust infrastructure for efficient petabyte-scale data collection, processing, and streaming across multiple clouds.
- Develop ETL tools for managing vast amounts of data.
- Develop innovative solutions for data-related issues in AI model training and implement rigorous data validation processes.
- Ensure data integrity and relevance throughout the pipeline.
- Stay updated with the latest state-of-the-art audio models for speech processing, such as pyannote, parakeet, and whisper.
- Contribute to Cantina’s open-source ML projects and associated communities.
Qualifications
- 5+ years of experience in data platform engineering or a relevant field.
- Experience building large-scale data processing pipelines with tools like PySpark, Beam, or Flink.
- Familiarity with Machine Learning and NLP, with a willingness to learn more on the job.
- Proven track record of adapting to new domains and a desire to use data to improve products.
- Experience as an ML engineer, Data Scientist, or in a similar role.
- Experience with cloud platforms like AWS or Azure, or tools such as Kubernetes and Terraform.
- Passionate about Conversational AI or large language models.
Location
We have offices located in Sunnyvale, CA, San Francisco, CA, and Brooklyn, NY. While we focus on individuals near our office hubs, we offer fully remote and hybrid employment opportunities.
Compensation
In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000 - $250,000 for those located in the San Francisco Bay Area, New York City, and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.
Application Process
Please submit your resume, cover letter, and any relevant portfolio or publications demonstrating your research contributions in AI.
Benefits Extracted with AI
- Health Care — 99% of premiums for medical, vision, dental are fully paid
- One Medical membership
- Monthly Stipend — $500/month
- 15 PTO days per year
- 9 sick days
- 13 paid company holidays
- Offices closed for winter break
- 401(K) — Eligible to participate on day one
- Parental Leave & Fertility Support
- Competitive Salary & Equity
- Lunch and snacks provided for in-office employees
- WFH equipment provided for full-time hybrid/remote employees
Similar jobs
Last update: 23 minutes ago
Senior C++ Computer Vision Engineer
Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.
Expert Machine Learning Engineer
Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.
Senior Fullstack Developer for AI-Driven Mission Technologies
Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.
Data Engineer with ETL and SQL Expertise
Join Holland Casino as a Data Engineer to build and maintain data infrastructure for the Online Casino, focusing on ETL, SQL, and cloud solutions.
Full Stack Developer with AI and API Expertise
Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.
Senior Backend/Data Engineer
Join Zalando as a Senior Backend/Data Engineer in Berlin to enhance our audience-building platform using AWS, Java, Scala, and SQL.
Cofounder - Full Stack Developer/Data Scientist for Climatech Startup
Join Carbon13 as a cofounder in climate tech, leveraging AI, data science, and software development to combat climate change.
Senior Data Engineer
Join i4talent as a Senior Data Engineer to lead cloud transitions and data projects. Enjoy a fun work environment with great benefits.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Cloud Data Engineer
Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!
LLM Backend Developer
Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.
AI Engineer
Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.
Staff Software Engineer, Data Platform
Join Personio as a Staff Software Engineer in Berlin to build scalable data platforms using Kafka, Kubernetes, and AWS. Drive innovation and excellence.
Senior Solutions Engineer
Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.
Senior Full-Stack Engineer ReactJS/NodeJS
Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.
Backend Software Engineer - Privacy Technology
Join Zalando as a Backend Software Engineer in Privacy Technology, focusing on data protection and privacy automation services.
Senior Backend Engineer - Payments
Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.
Senior Software Engineer - C#/.NET
Join TrueLayer as a Senior Software Engineer in Milan, working with C#, .NET, AWS, and Kubernetes to build scalable systems.
Senior IoT Engineer
Join Skytree as a Senior IoT Engineer to lead IoT projects, focusing on Azure IoT solutions, edge computing, and data pipelines.
Senior Full-Stack Engineer - TypeScript, React, Node.js
Join us as a Senior Full-Stack Engineer to develop a super app for medical professionals using TypeScript, React, and Node.js.
Senior Backend Engineer - PHP, Symfony, Laravel
Join Instapro Group as a Senior Backend Engineer, working with PHP, Symfony, and Laravel in a hybrid environment.
Senior Software Engineer - Python, Django, Angular
Join Ilkari as a Senior Software Engineer to lead development in Python, Django, and Angular, creating scalable solutions in a hybrid work environment.
Software Engineer - Cloud Applications and Python
Join Topicus as a Software Engineer in Arnhem to develop cloud applications using Python, REST APIs, and ETL processes for healthcare data services.