Job Overview
We are seeking an AI Inference Engineer to join our dynamic team at Perplexity in San Francisco. This role involves working on large-scale deployment of machine learning models for real-time inference, focusing on both internal and external applications.
Responsibilities
- Develop APIs for AI inference to be used by a diverse range of customers.
- Benchmark and address bottlenecks in our inference stack.
- Improve the reliability and observability of our systems and respond to system outages.
- Explore novel research and implement LLM inference optimizations.
Qualifications
- Experience with ML systems and deep learning frameworks such as PyTorch, TensorFlow, and ONNX.
- Familiarity with common LLM architectures and inference optimization techniques like continuous batching and quantization.
- Optional: Understanding of GPU architectures or experience with GPU kernel programming using CUDA.
Company Growth and Opportunities
Perplexity has experienced tremendous growth, amassing 10 million monthly active users and serving over 500 million queries globally. With significant funding and a valuation over $1 billion, we offer substantial opportunities for career advancement and impact.
Compensation and Benefits
- Salary Range: $190,000 - $240,000 annually.
- Equity: Equity is part of the total compensation package.
- Benefits: Comprehensive health, dental, and vision insurance for you and your dependents, including a 401(k) plan.
Join us at Perplexity, where your work in AI inference will help drive the future of real-time, large-scale machine learning applications.
Benefits Extracted with AI
- Comprehensive health, dental, and vision insurance
- 401(k) plan
Similar jobs
Last update: 23 minutes ago
AI Research Engineer
Join Perplexity as an AI Research Engineer to innovate AI-powered search solutions using LLMs in San Francisco.
AI Research Engineer - LLM Training
Join Perplexity as an AI Research Engineer to enhance LLMs using AI, ML, and NLP in San Francisco.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Principal AI Engineer
Join Cere Network as a Principal AI Engineer to drive AI innovation in Web3. Requires 10+ years in AI/ML, NLP, and software development.
Software Engineer - Autonomous Driving
Join Applied Intuition as a Software Engineer in Munich to tackle autonomous driving challenges with top ADAS/AV programs.
Senior AI Engineer
Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.
AI Engineer
Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.
Senior Software Engineer (C++)
Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.
Information Retrieval Algorithm Engineer
Join Huawei as an Information Retrieval Algorithm Engineer to develop cutting-edge AI technologies in Amsterdam.
AI Solutions Software Engineer
Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama. Remote position in Palo Alto, CA.
Expert Machine Learning Engineer
Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Senior Backend Engineer (Go)
Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.
Senior Backend Engineer - Java, Rust, Go
Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.
Machine Learning Engineer - LLM Fine-tuning and Performance
Join NVIDIA as a Machine Learning Engineer specializing in LLM fine-tuning and performance optimization. Work with cutting-edge ML technologies.
Staff Software Engineer, Data Platform
Join Personio as a Staff Software Engineer in Berlin to build scalable data platforms using Kafka, Kubernetes, and AWS. Drive innovation and excellence.
Senior Software Engineer - Python, Apache Kafka
Join Aiven as a Senior Software Engineer in Berlin, focusing on Python and Apache Kafka in a hybrid work environment.
Cloud Data Engineer
Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!
Frontend Engineer, AI
Join Aleph as a Frontend Engineer focusing on AI to develop innovative features using React.js and AI technologies in a remote role.
Software Engineer II - Developer Experience
Join Elastic as a Software Engineer II in Developer Experience, focusing on test frameworks for Kibana. Remote work, competitive benefits.
Back End Engineer with Node.js and TypeScript
Join LegalFly as a Back End Engineer to revolutionize legal AI with Node.js and TypeScript in a hybrid role in Ghent.
Senior Software Engineer - LLM
Join Bonfy.AI as a Senior Software Engineer to develop and optimize scalable machine learning models using Python, TensorFlow, and cloud platforms.
Front-end Angular Engineer
Join Zivver as a Front-end Angular Engineer to shape the future of secure web applications. Work with Angular, TypeScript, and more in Amsterdam.
Data Engineer with Azure and PySpark
Join BeFrank as a Data Engineer to build and enhance our data platform using Azure and PySpark. Hybrid work in Amsterdam.