Perplexity logo

AI Inference Engineer

Perplexity

Job Overview

We are seeking an AI Inference Engineer to join our dynamic team at Perplexity in San Francisco. This role involves working on large-scale deployment of machine learning models for real-time inference, focusing on both internal and external applications.

Responsibilities

  • Develop APIs for AI inference to be used by a diverse range of customers.
  • Benchmark and address bottlenecks in our inference stack.
  • Improve the reliability and observability of our systems and respond to system outages.
  • Explore novel research and implement LLM inference optimizations.

Qualifications

  • Experience with ML systems and deep learning frameworks such as PyTorch, TensorFlow, and ONNX.
  • Familiarity with common LLM architectures and inference optimization techniques like continuous batching and quantization.
  • Optional: Understanding of GPU architectures or experience with GPU kernel programming using CUDA.

Company Growth and Opportunities

Perplexity has experienced tremendous growth, amassing 10 million monthly active users and serving over 500 million queries globally. With significant funding and a valuation over $1 billion, we offer substantial opportunities for career advancement and impact.

Compensation and Benefits

  • Salary Range: $190,000 - $240,000 annually.
  • Equity: Equity is part of the total compensation package.
  • Benefits: Comprehensive health, dental, and vision insurance for you and your dependents, including a 401(k) plan.

Join us at Perplexity, where your work in AI inference will help drive the future of real-time, large-scale machine learning applications.

Benefits
Extracted with AI

  • Comprehensive health, dental, and vision insurance
  • 401(k) plan

Similar jobs

Last update: 23 minutes ago

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.

Perplexity logo
Perplexity

AI Research Engineer

Join Perplexity as an AI Research Engineer to innovate AI-powered search solutions using LLMs in San Francisco.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Perplexity logo
Perplexity

AI Research Engineer - LLM Training

Join Perplexity as an AI Research Engineer to enhance LLMs using AI, ML, and NLP in San Francisco.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

Persona logo
Persona

LLM Backend Developer

Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.

BCG X logo
BCG X

AI Engineer

Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.

Huawei Nederland logo
Huawei Nederland

Information Retrieval Algorithm Engineer

Join Huawei as an Information Retrieval Algorithm Engineer to develop cutting-edge AI technologies in Amsterdam.

Reddit, Inc. logo
Reddit, Inc.

Senior Solutions Engineer

Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

Darktrace logo
Darktrace

Solutions Engineer

Join Darktrace as a Solutions Engineer in Amsterdam, providing technical pre-sales and post-sales support in a hybrid work environment.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Reaktor logo
Reaktor

Lead Developer with DevOps and Functional Programming

Join Reaktor as a Lead Developer in Amsterdam, focusing on DevOps, Functional Programming, and JavaScript in a hybrid work environment.

Catalyze Group logo
Catalyze Group

Full Stack Developer with AI and API Expertise

Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.

Nebius AI logo
Nebius AI

Senior Software Engineer (C++)

Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.

Instapro Group logo
Instapro Group

Senior Backend Engineer - Payments

Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.

Cere Network logo
Cere Network

Principal AI Engineer

Join Cere Network as a Principal AI Engineer to drive AI innovation in Web3. Requires 10+ years in AI/ML, NLP, and software development.

AnyDesk Software logo
AnyDesk Software

Salesforce Software Engineer

Join AnyDesk as a Salesforce Software Engineer to develop and maintain internal business systems in a dynamic, remote-friendly environment.

Personio logo
Personio

Staff Software Engineer, Data Platform

Join Personio as a Staff Software Engineer in Berlin to build scalable data platforms using Kafka, Kubernetes, and AWS. Drive innovation and excellence.

Instapro Group logo
Instapro Group

Senior Backend Engineer - PHP, Symfony, Laravel

Join Instapro Group as a Senior Backend Engineer, working with PHP, Symfony, and Laravel in a hybrid environment.

Aiven logo
Aiven

Senior Software Engineer - Python, Apache Kafka

Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.

Gorgias logo
Gorgias

Senior Full-Stack Engineer ReactJS/NodeJS

Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.

Pleo logo
Pleo

Senior Backend Engineer (Kotlin, Microservices)

Join Pleo as a Senior Backend Engineer, focusing on Kotlin and microservices in Berlin. Drive innovation in a dynamic, collaborative environment.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!