About Cantina
Cantina is a pioneering social platform with the most advanced AI character creator. Our platform allows users to build, share, and interact with AI bots and friends directly in the Cantina or across the internet. Cantina bots are lifelike, social creatures capable of interacting wherever humans go online. Whether you want to recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters, Cantina offers a new media type that provides creators with infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.
About The Role
As a Senior Machine Learning Engineer, Post-Training, you will be part of the team responsible for developing our powerful pretrained language models into intelligent, engaging, and aligned products. You will work across teams and our technical stack to improve our model performance and training methods, including data, compute, and algorithms. This role will allow you to shape the conversational experience of humans and bots.
Responsibilities
- Run evaluation and analysis of pre-trained models against certain performance and alignment issues.
- Address these issues with additional training and fine-tuning techniques like DPO or RLHF.
- Create datasets to address these issues.
- Develop algorithms to improve performance metrics.
- Contribute to Cantina’s open-source ML projects.
Requirements
- 5+ years of experience building production-grade LLMs or Speech & Audio machine learning models in industry and/or academic research settings.
- Experience with data processing, analysis, and curation.
- Strong understanding of modern machine learning techniques (DPO, RLHF, transformers, etc).
- Track record of exceptional research or creative applied ML projects.
- Experience with product experimentation and A/B testing.
- Experience training large models in a distributed setting.
- Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud).
Location
We have offices located in Sunnyvale, CA, San Francisco, CA, and Brooklyn, NY. While we have a strong focus on individuals near our office hubs, we offer fully remote and hybrid employment opportunities.
Pay Equity
In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000 - $250,000 for those located in the San Francisco Bay Area, New York City, and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.
Application Process
Please submit your resume, cover letter, and any relevant portfolio or publications demonstrating your research contributions in AI.
Benefits Summary
- Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
- Monthly Stipend — $500/month to use on whatever you’d like!
- Rest and Recharge — 15 PTO days per year, 9 sick days, 13 paid company holidays, and offices closed for winter break (Christmas Eve to New Years Day)!
- 401(K) — Eligible to participate on day one of employment.
- Parental Leave & Fertility Support
- Competitive Salary & Equity
- Lunch and snacks provided for in-office employees.
- WFH equipment provided for full-time hybrid/remote employees.
Benefits Extracted with AI
- Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
- Monthly Stipend — $500/month to use on whatever you’d like!
- Rest and Recharge — 15 PTO days per year, 9 sick days, 13 paid company holidays, and offices closed for winter break (Christmas Eve to New Years Day)!
- 401(K) — Eligible to participate on day one of employment.
- Parental Leave & Fertility Support
- Competitive Salary & Equity
- Lunch and snacks provided for in-office employees.
- WFH equipment provided for full-time hybrid/remote employees.
Similar jobs
Last update: 23 minutes ago
Senior Machine Learning Engineer
Join Cantina as a Senior Machine Learning Engineer to design and maintain ML infrastructure, optimize performance, and integrate models.
Senior Machine Learning Engineer
Senior ML Engineer at Cantina, designing AI models for a social platform. Skills in AI, ML, NLP, Python. Remote options available.
Senior Machine Learning Engineer - Data
Join Cantina as a Senior Machine Learning Engineer focusing on data collection and AI development.
Senior Machine Learning Engineer - Images
Join Cantina as a Senior Machine Learning Engineer to design and improve AI models for image generation.
Research Scientist - AI and Computer Vision
Join Cantina as a Research Scientist to advance AI-driven social platforms with cutting-edge video and image generation models.
Senior C++ Computer Vision Engineer
Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.
Senior Mobile Gaming Engineer
Join Cantina as a Senior Mobile Gaming Engineer to design AI-embedded mobile games. Work with iOS, Android, and web technologies.
Expert Machine Learning Engineer
Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.
Senior Backend Engineer (Go)
Senior Backend Engineer specializing in Go, involved in building and maintaining complex systems with a focus on reliability and scalability.
Senior Mobile Gaming Engineer
Join Cantina as a Senior Mobile Gaming Engineer to design and build AI-embedded mobile-first gaming platforms.
Senior Fullstack Developer for AI-Driven Mission Technologies
Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.
Senior Media Software Engineer (Real-Time)
Senior Media Software Engineer needed for AI-driven real-time media platform, skilled in C/C++, WebRTC, and mobile development.
Full Stack Developer with AI and API Expertise
Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.
LLM Backend Developer
Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.
AI Engineer
Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.
Senior AI Engineer
Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Senior Full-Stack Engineer ReactJS/NodeJS
Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.
Senior AI Engineer
Join Poggio as a Senior AI Engineer to revolutionize sales with AI. Work remotely, leverage LLMs, and enhance AI systems.
Senior AI/ML Engineer
Join Coda as a Senior AI/ML Engineer to develop cutting-edge AI solutions using Python, NLP, and ML frameworks. Remote work available.
Cofounder - Full Stack Developer/Data Scientist for Climatech Startup
Join Carbon13 as a cofounder in climate tech, leveraging AI, data science, and software development to combat climate change.
Senior Machine Learning Systems Engineer
Senior ML Systems Engineer role focusing on developing AI training systems with competitive benefits in Seattle.
Principal AI Engineer
Join Cere Network as a Principal AI Engineer to drive AI innovation in Web3. Requires 10+ years in AI/ML, NLP, and software development.