Cantina logo

Senior Machine Learning Engineer, Post-Training

Cantina

About Cantina

Cantina is a pioneering social platform with the most advanced AI character creator. Our platform allows users to build, share, and interact with AI bots and friends directly in the Cantina or across the internet. Cantina bots are lifelike, social creatures capable of interacting wherever humans go online. Whether you want to recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters, Cantina offers a new media type that provides creators with infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.

About The Role

As a Senior Machine Learning Engineer, Post-Training, you will be part of the team responsible for developing our powerful pretrained language models into intelligent, engaging, and aligned products. You will work across teams and our technical stack to improve our model performance and training methods, including data, compute, and algorithms. This role will allow you to shape the conversational experience of humans and bots.

Responsibilities

  • Run evaluation and analysis of pre-trained models against certain performance and alignment issues.
  • Address these issues with additional training and fine-tuning techniques like DPO or RLHF.
  • Create datasets to address these issues.
  • Develop algorithms to improve performance metrics.
  • Contribute to Cantina’s open-source ML projects.

Requirements

  • 5+ years of experience building production-grade LLMs or Speech & Audio machine learning models in industry and/or academic research settings.
  • Experience with data processing, analysis, and curation.
  • Strong understanding of modern machine learning techniques (DPO, RLHF, transformers, etc).
  • Track record of exceptional research or creative applied ML projects.
  • Experience with product experimentation and A/B testing.
  • Experience training large models in a distributed setting.
  • Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud).

Location

We have offices located in Sunnyvale, CA, San Francisco, CA, and Brooklyn, NY. While we have a strong focus on individuals near our office hubs, we offer fully remote and hybrid employment opportunities.

Pay Equity

In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000 - $250,000 for those located in the San Francisco Bay Area, New York City, and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Application Process

Please submit your resume, cover letter, and any relevant portfolio or publications demonstrating your research contributions in AI.

Benefits Summary

  • Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
  • Monthly Stipend — $500/month to use on whatever you’d like!
  • Rest and Recharge — 15 PTO days per year, 9 sick days, 13 paid company holidays, and offices closed for winter break (Christmas Eve to New Years Day)!
  • 401(K) — Eligible to participate on day one of employment.
  • Parental Leave & Fertility Support
  • Competitive Salary & Equity
  • Lunch and snacks provided for in-office employees.
  • WFH equipment provided for full-time hybrid/remote employees.

Benefits
Extracted with AI

  • Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
  • Monthly Stipend — $500/month to use on whatever you’d like!
  • Rest and Recharge — 15 PTO days per year, 9 sick days, 13 paid company holidays, and offices closed for winter break (Christmas Eve to New Years Day)!
  • 401(K) — Eligible to participate on day one of employment.
  • Parental Leave & Fertility Support
  • Competitive Salary & Equity
  • Lunch and snacks provided for in-office employees.
  • WFH equipment provided for full-time hybrid/remote employees.

Similar jobs

Last update: 23 minutes ago

Cantina logo
Cantina

Senior Machine Learning Engineer

Join Cantina as a Senior Machine Learning Engineer to design and maintain ML infrastructure, optimize performance, and integrate models.

Cantina logo
Cantina

Senior Machine Learning Engineer - Data

Join Cantina as a Senior Machine Learning Engineer focusing on data collection and AI development.

Niantic, Inc. logo
Niantic, Inc.

Senior Software Engineer, Machine Learning

Join Niantic as a Senior Software Engineer in Machine Learning to enhance products using generative AI technologies.

Cantina logo
Cantina

Senior Machine Learning Engineer - Images

Join Cantina as a Senior Machine Learning Engineer to design and improve AI models for image generation.

Cantina logo
Cantina

Senior Mobile Gaming Engineer

Join Cantina as a Senior Mobile Gaming Engineer to design and build AI-embedded mobile-first gaming platforms.

LlamaIndex logo
LlamaIndex

Founding Applied AI Engineer

Join LlamaIndex as a Founding Applied AI Engineer to build and deploy LLM applications. Competitive salary and equity offered.

Intuit logo
Intuit

Senior Machine Learning Engineer

Join Intuit as a Senior Machine Learning Engineer to innovate and scale AI algorithms in San Diego.

Inclusively logo
Inclusively

Senior Software Engineer, Machine Learning

Join as a Senior Software Engineer in Machine Learning, working remotely to build ML-driven products for user engagement.

CHAI: AI Platform logo
CHAI: AI Platform

Senior ML Infrastructure Engineer

Join CHAI: AI Platform as a Senior ML Infrastructure Engineer to build and scale ML systems in Palo Alto.

CHAI: AI Platform logo
CHAI: AI Platform

Senior Applied AI Researcher

Join CHAI: AI Platform as a Senior Applied AI Researcher to optimize and innovate AI solutions in a high-growth environment.

LlamaIndex logo
LlamaIndex

Founding AI Engineer

Join LlamaIndex as a Founding AI Engineer to shape the future of LLM applications with cutting-edge AI projects.

Standard AI logo
Standard AI

Senior Software Engineer, Backend

Join Standard AI as a Senior Backend Engineer to design scalable microservices and APIs. Remote role with competitive salary and benefits.

Accrete AI logo
Accrete AI

Senior Prompt Engineer

Join Accrete AI as a Senior Prompt Engineer to design and optimize prompts for AI agents, enhancing NLP applications.

micro1 logo
micro1

Machine Learning Engineer with AI/ML Experience

Join us as a Machine Learning Engineer to develop AI/ML models and applications. Work remotely with top-tier companies.

Olo logo
Olo

Senior Machine Learning Engineer

Join Olo as a Senior Machine Learning Engineer to build and scale ML models for the restaurant industry. Remote work available.

Cantina logo
Cantina

Senior Mobile Gaming Engineer

Join Cantina as a Senior Mobile Gaming Engineer to design AI-embedded mobile games. Work with iOS, Android, and web technologies.

Atypon logo
Atypon

Senior Machine Learning Engineer

Join Atypon as a Senior ML Engineer to develop AI solutions in NLP, deep learning, and MLOps. Remote position in Athens.

Ema Unlimited logo
Ema Unlimited

Machine Learning Engineer

Join Ema Unlimited as a Machine Learning Engineer in SF Bay Area, working on cutting-edge AI solutions with a focus on NLP and ML technologies.

Lattice logo
Lattice

Senior Software Engineer, AI

Join Lattice as a Senior Software Engineer, AI, focusing on AI/ML technologies and large language models.

LlamaIndex logo
LlamaIndex

Founding AI Engineer, Backend

Join LlamaIndex as a Founding AI Engineer, Backend to build scalable cloud services for LLM applications.

Leonardo.Ai logo
Leonardo.Ai

Mid-Level AI Researcher

Join Leonardo.Ai as a Mid-Level AI Researcher to develop and refine AI models, focusing on model training and optimization.

Cascading AI (YC S23) logo
Cascading AI (YC S23)

Senior Full-stack Engineer

Join Cascading AI as a Senior Full-stack Engineer to develop AI-driven lending solutions in San Francisco.

Meta logo
Meta

Research Engineer, Language - Generative AI

Join Meta as a Research Engineer in Generative AI, focusing on large language models and NLP.

Argon AI (YC W24) logo
Argon AI (YC W24)

Founding Applied AI Engineer

Join Argon AI as a Founding Applied AI Engineer to lead AI initiatives in pharma, focusing on domain-specific AI and RAG systems.