Cantina logo

Senior Machine Learning Engineer - Data

Cantina

About Cantina

Cantina, founded by Sean Parker, is a pioneering social platform featuring the most advanced AI character creator. Our platform allows users to build, share, and interact with AI bots and friends directly within Cantina or across the internet. These bots are lifelike, social creatures capable of interacting wherever humans go online. Whether recreating yourself using powerful AI, imagining someone new, or choosing from thousands of existing characters, Cantina offers a new media type for creators to share infinitely scalable and personalized content experiences, combined with seamless group chat across voice, video, and text.

About the Role

As a Senior Machine Learning Engineer on the Data team, you will focus on two key areas: large-scale data collection and refined dataset creation. You will expertly manage diverse data sources, build robust infrastructure for petabyte-scale data handling, and craft comprehensive training datasets through experimentation and evaluation. This role is central to our AI development process, requiring meticulous attention to detail and the ability to make impactful design decisions.

Responsibilities

  • Identify and collect data at scale for different types of speech models.
  • Work with large amounts of audio data and different types of speech models, including automatic speech recognition, speaker diarization, and audio classifiers.
  • Design and implement robust infrastructure for efficient petabyte-scale data collection, processing, and streaming across multiple clouds.
  • Develop ETL tools for managing vast amounts of data.
  • Develop innovative solutions for data-related issues in AI model training and implement rigorous data validation processes.
  • Ensure data integrity and relevance throughout the pipeline.
  • Stay updated with the latest state-of-the-art audio models for speech processing, such as pyannote, parakeet, and whisper.
  • Contribute to Cantina’s open-source ML projects and associated communities.

Qualifications

  • 5+ years of experience in data platform engineering or a relevant field.
  • Experience building large-scale data processing pipelines with tools like PySpark, Beam, or Flink.
  • Familiarity with Machine Learning and NLP, with a willingness to learn more on the job.
  • Proven track record of adapting to new domains and a desire to use data to improve products.
  • Experience as an ML engineer, Data Scientist, or in a similar role.
  • Experience with cloud platforms like AWS or Azure, or tools such as Kubernetes and Terraform.
  • Passionate about Conversational AI or large language models.

Location

We have offices located in Sunnyvale, CA, San Francisco, CA, and Brooklyn, NY. While we focus on individuals near our office hubs, we offer fully remote and hybrid employment opportunities.

Compensation

In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000 - $250,000 for those located in the San Francisco Bay Area, New York City, and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Application Process

Please submit your resume, cover letter, and any relevant portfolio or publications demonstrating your research contributions in AI.

Benefits
Extracted with AI

  • Health Care — 99% of premiums for medical, vision, dental are fully paid
  • One Medical membership
  • Monthly Stipend — $500/month
  • 15 PTO days per year
  • 9 sick days
  • 13 paid company holidays
  • Offices closed for winter break
  • 401(K) — Eligible to participate on day one
  • Parental Leave & Fertility Support
  • Competitive Salary & Equity
  • Lunch and snacks provided for in-office employees
  • WFH equipment provided for full-time hybrid/remote employees

Similar jobs

Last update: 23 minutes ago

Cantina logo
Cantina

Senior Machine Learning Engineer

Join Cantina as a Senior Machine Learning Engineer to design and maintain ML infrastructure, optimize performance, and integrate models.

Cantina logo
Cantina

Senior Machine Learning Engineer, Post-Training

Join Cantina as a Senior Machine Learning Engineer to develop advanced AI models and shape human-bot interactions.

Cantina logo
Cantina

Senior Machine Learning Engineer

Senior ML Engineer at Cantina, designing AI models for a social platform. Skills in AI, ML, NLP, Python. Remote options available.

Cantina logo
Cantina

Senior Machine Learning Engineer - Images

Join Cantina as a Senior Machine Learning Engineer to design and improve AI models for image generation.

Cantina logo
Cantina

Research Scientist - AI and Computer Vision

Join Cantina as a Research Scientist to advance AI-driven social platforms with cutting-edge video and image generation models.

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

Cantina logo
Cantina

Senior Mobile Gaming Engineer

Join Cantina as a Senior Mobile Gaming Engineer to design AI-embedded mobile games. Work with iOS, Android, and web technologies.

Cantina logo
Cantina

Senior Mobile Gaming Engineer

Join Cantina as a Senior Mobile Gaming Engineer to design and build AI-embedded mobile-first gaming platforms.

Holland Casino logo
Holland Casino

Data Engineer with ETL and SQL Expertise

Join Holland Casino as a Data Engineer to build and maintain data infrastructure for the Online Casino, focusing on ETL, SQL, and cloud solutions.

Cantina logo
Cantina

Senior Backend Engineer (Go)

Senior Backend Engineer specializing in Go, involved in building and maintaining complex systems with a focus on reliability and scalability.

Catalyze Group logo
Catalyze Group

Full Stack Developer with AI and API Expertise

Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.

Zalando logo
Zalando

Senior Backend/Data Engineer

Join Zalando as a Senior Backend/Data Engineer in Berlin to enhance our audience-building platform using AWS, Java, Scala, and SQL.

i4talent detachering logo
i4talent detachering

Senior Data Engineer

Join i4talent as a Senior Data Engineer to lead cloud transitions and data projects. Enjoy a fun work environment with great benefits.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Cantina logo
Cantina

Senior Media Software Engineer (Real-Time)

Senior Media Software Engineer needed for AI-driven real-time media platform, skilled in C/C++, WebRTC, and mobile development.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!

Persona logo
Persona

LLM Backend Developer

Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.

Poggio logo
Poggio

Senior AI Engineer

Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.

Alt logo
Alt

Senior Machine Learning Engineer

Senior Machine Learning Engineer role focusing on data problems, algorithm development, and model production in San Francisco.

Snowflake logo
Snowflake

Senior Software Engineer - LLM

Join Snowflake as a Senior Software Engineer to build scalable machine learning platforms with LLMs, leveraging Python and TensorFlow.

Carbon13 logo
Carbon13

Cofounder - Full Stack Developer/Data Scientist for Climatech Startup

Join Carbon13 as a cofounder in climate tech, leveraging AI, data science, and software development to combat climate change.