Cantina logo

Senior Machine Learning Engineer, Post-Training

Cantina

About Cantina

Cantina is a pioneering social platform with the most advanced AI character creator. Our platform allows users to build, share, and interact with AI bots and friends directly in the Cantina or across the internet. Cantina bots are lifelike, social creatures capable of interacting wherever humans go online. Whether you want to recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters, Cantina offers a new media type that provides creators with infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.

About The Role

As a Senior Machine Learning Engineer, Post-Training, you will be part of the team responsible for developing our powerful pretrained language models into intelligent, engaging, and aligned products. You will work across teams and our technical stack to improve our model performance and training methods, including data, compute, and algorithms. This role will allow you to shape the conversational experience of humans and bots.

Responsibilities

  • Run evaluation and analysis of pre-trained models against certain performance and alignment issues.
  • Address these issues with additional training and fine-tuning techniques like DPO or RLHF.
  • Create datasets to address these issues.
  • Develop algorithms to improve performance metrics.
  • Contribute to Cantina’s open-source ML projects.

Requirements

  • 5+ years of experience building production-grade LLMs or Speech & Audio machine learning models in industry and/or academic research settings.
  • Experience with data processing, analysis, and curation.
  • Strong understanding of modern machine learning techniques (DPO, RLHF, transformers, etc).
  • Track record of exceptional research or creative applied ML projects.
  • Experience with product experimentation and A/B testing.
  • Experience training large models in a distributed setting.
  • Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud).

Location

We have offices located in Sunnyvale, CA, San Francisco, CA, and Brooklyn, NY. While we have a strong focus on individuals near our office hubs, we offer fully remote and hybrid employment opportunities.

Pay Equity

In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000 - $250,000 for those located in the San Francisco Bay Area, New York City, and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Application Process

Please submit your resume, cover letter, and any relevant portfolio or publications demonstrating your research contributions in AI.

Benefits Summary

  • Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
  • Monthly Stipend — $500/month to use on whatever you’d like!
  • Rest and Recharge — 15 PTO days per year, 9 sick days, 13 paid company holidays, and offices closed for winter break (Christmas Eve to New Years Day)!
  • 401(K) — Eligible to participate on day one of employment.
  • Parental Leave & Fertility Support
  • Competitive Salary & Equity
  • Lunch and snacks provided for in-office employees.
  • WFH equipment provided for full-time hybrid/remote employees.

Benefits
Extracted with AI

  • Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
  • Monthly Stipend — $500/month to use on whatever you’d like!
  • Rest and Recharge — 15 PTO days per year, 9 sick days, 13 paid company holidays, and offices closed for winter break (Christmas Eve to New Years Day)!
  • 401(K) — Eligible to participate on day one of employment.
  • Parental Leave & Fertility Support
  • Competitive Salary & Equity
  • Lunch and snacks provided for in-office employees.
  • WFH equipment provided for full-time hybrid/remote employees.

Similar jobs

Last update: 23 minutes ago

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

BCG X logo
BCG X

AI Engineer

Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Cere Network logo
Cere Network

Principal AI Engineer

Join Cere Network as a Principal AI Engineer to drive AI innovation in Web3. Requires 10+ years in AI/ML, NLP, and software development.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Together AI logo
Together AI

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

Poggio logo
Poggio

Senior AI Engineer

Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.

Personio logo
Personio

Staff Software Engineer, Data Platform

Join Personio as a Staff Software Engineer in Berlin to build scalable data platforms using Kafka, Kubernetes, and AWS. Drive innovation and excellence.

n8n logo
n8n

Senior Software Engineer (Node.js & TypeScript)

Join n8n as a Senior Software Engineer to build AI applications using Node.js and TypeScript. Remote role within Europe.

TrueLayer logo
TrueLayer

Senior Software Engineer - C#/.NET

Join TrueLayer as a Senior Software Engineer in Milan, working with C#, .NET, AWS, and Kubernetes to build scalable systems.

Nebius AI logo
Nebius AI

Senior Software Engineer (C++)

Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.

HeyJobs logo
HeyJobs

Senior Software Engineer - AWS, Python, Ruby on Rails

Join HeyJobs as a Senior Software Engineer to design scalable systems using AWS, Python, and Ruby on Rails in a dynamic team.

DwellFi  logo
DwellFi

AI Solutions Software Engineer

Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama. Remote position in Palo Alto, CA.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!

Ilkari logo
Ilkari

Senior Software Engineer - Python, Django, Angular

Join Ilkari as a Senior Software Engineer to lead development in Python, Django, and Angular, creating scalable solutions in a hybrid work environment.

Attio logo
Attio

Senior Product Engineer [Rust & Typescript]

Join Attio as a Senior Product Engineer working with Rust & TypeScript to build innovative CRM features. Remote work available.

Motius logo
Motius

Senior Backend Developer

Join Motius as a Senior Backend Developer to work on cutting-edge R&D projects using AWS, Docker, GraphQL, and more in a hybrid work environment.

Nebius AI logo
Nebius AI

Senior Backend Engineer (Go)

Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.

Zendesk logo
Zendesk

Senior Backend Engineer (Zendesk AI Agents)

Join Zendesk as a Senior Backend Engineer to develop AI-driven chatbots using TypeScript, MongoDB, and microservices architecture.

Aiven logo
Aiven

Senior Software Engineer - Python, Apache Kafka

Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.

Labelbox logo
Labelbox

Full-Stack Engineer with Angular and React.js

Join Labelbox as a Full-Stack Engineer to develop scalable systems using Angular, React.js, and GraphQL. Work remotely in a dynamic AI-driven environment.

Applied Intuition logo
Applied Intuition

Software Engineer - Autonomous Driving

Join Applied Intuition as a Software Engineer in Munich to tackle autonomous driving challenges with top ADAS/AV programs.

Huawei Nederland logo
Huawei Nederland

Senior ASR / TTS Researcher

Join Huawei's research center in Amsterdam as a Senior ASR/TTS Researcher, focusing on speech synthesis and AI.

MoonPay logo
MoonPay

Machine Learning Engineer

Join MoonPay as a Machine Learning Engineer to build and maintain ML infrastructure, collaborating with data scientists and cross-functional teams.