Post a job

Senior Software Engineer, AI Inference

Company Overview

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI models including speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

Opportunity

We are seeking a backend engineer focused on AI inference to join the team powering Deepgram’s core speech inference APIs. You’ll implement and optimize inference code, experiment with cutting-edge technologies, and develop, maintain, and deploy the stack of services behind our blazing-fast, massive-throughput inference system. This role blends work on backend services and systems with domain specialty in neural networks and GPU programming. Our team owns the applications that serve api.deepgram.com and empowers builders of innovative speech products by focusing on a world-class combination of reliability, efficiency, and latency.

What You’ll Do

Implement inference for novel model architectures developed by Deepgram’s trailblazing research team
Develop, test, and deploy application code for massive-scale production services
Debug complex system issues that include networking, scheduling, and high-performance computing interactions
Build tooling for internal analysis and benchmarking to identify opportunities for efficiency improvements
Experiment with optimization techniques for ML workloads on NVIDIA GPUs and ship the key wins to prod

You’ll Love This Role If You

Think of yourself as a generalist while enjoying learning deeply in specific areas, causing you to go from debugging a customer issue one day to designing an algorithm the next
Like sipping piña coladas and getting caught in the rain
Enjoy taking ownership of features from early collaborations with researchers through testing in production
Love getting nitty-gritty with profilers, hardware architectures, and inference algorithms
Want to work within the context of a humble, collaborative team that collectively owns mission-critical production services

It’s Important to Us That You Have

The ability to work collaboratively in a fast-paced environment and adapt to changing priorities
Proven industry experience building and shipping production services
Strong confidence in a lower-level language like C, C++, or Rust
Experience slicing large projects or initiatives into smaller experiments or incremental improvements
Expertise in a ML framework like Torch or Tensorflow
Experience with GPU programming using tools like CUDA or libraries like cuDNN, cuBLAS, etc.

It Would Be Great If You Also Had

Extensive professional experience with Rust and C++
Experience optimizing ML workloads in production
Familiarity with GPU hardware architecture and its impact on inference pipelines

Benefits
Extracted with AI

Remote work flexibility

Similar jobs

Last update: 23 minutes ago

Software Engineer, Front End

Join Deepgram as a Front End Software Engineer to build innovative voice AI applications using Elm and React.

Voice Technology

Web Applications

a year ago

Senior/Principal Software Engineer

Join Groq as a Senior/Principal Software Engineer to design and develop scalable software for AI inference technology.

Software Developm..

Software Architec..

9 months ago

Software Engineer, Developer Experience

Join Groq as a Software Engineer to enhance APIs for fast AI applications, focusing on optimization and high-performance computing.

High Performance ..

a year ago

Senior Software Engineer (AI/ML)

Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.

Software Developm..

Software Architec..

a year ago

Senior Software Engineer, Rust

Senior Software Engineer specializing in Rust, remote role in Italy, focusing on high-performance systems and cloud services.

Production Systems

a year ago

Senior Backend Engineer

Join Hamming AI as a Senior Backend Engineer to scale AI voice agent systems. Work with Next.JS, TypeScript, Python, and more.

8 months ago

Senior Software Engineer - Generative AI, AGI Inference Engine

Join Amazon as a Senior Software Engineer to advance Generative AI capabilities, focusing on high-performance inference.

Software Developm..

Coding Standards

8 months ago

AI Inference Engineer

Join Perplexity as an AI Inference Engineer in San Francisco, focusing on large-scale ML model deployments and system reliability.

Natural Language ..

Kernel Programming

a year ago

Senior Software Engineer - Generative AI Voice Technology

Senior Software Engineer role in Generative AI Voice Technology, pioneering large voice models with competitive benefits.

a year ago

Senior AI Engineer

Senior AI Engineer role focusing on Go, TypeScript, Python in a remote-first, innovative AI platform.

Computer Science

Problem Solving

High Performance ..

a year ago

Senior Backend Engineer, Forward Deployed

Senior Backend Engineer role focusing on infrastructure, customer solutions, and deployment in San Francisco.

Software Deployme..

Representational ..

Data Strategies

10 months ago

Software Engineer - Developer Foundations

Join AssemblyAI as a Software Engineer to build scalable backend systems and developer tools. Work remotely with cutting-edge AI technology.

Software Developm..

Back-End Web Deve..

8 months ago

Senior Software Engineer (AI/ML)

Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.

Software Developm..

Product Requireme..

Software Architec..

a year ago

Senior Software Engineer - Media Infrastructure

Join Discord as a Senior Software Engineer in Media Infrastructure, focusing on backend development, Python, Rust, and media management.

Software Developm..

Back-End Web Deve..

10 months ago

Senior Software Engineer, Machine Learning Platform

Join Discord as a Senior Software Engineer in Machine Learning Platform, working remotely with AWS, Generative AI, and real-time data.

Software Developm..

Cascading Style S..

Back-End Web Deve..

9 months ago

Senior Software Engineer - Generative AI

Join Amazon as a Senior Software Engineer in Generative AI, focusing on high-performance inference capabilities.

Software Developm..

Coding Standards

8 months ago

Meaningful Gigs

Senior Software Engineer - Backend (Founding Member)

Senior Backend Software Engineer role focusing on innovative AI-driven tools, remote work, competitive benefits.

Web Applications

Software Developm..

10 months ago

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Parallel Computing

Software Developm..

Data Structures

Back-End Web Deve..

7 months ago

Senior Machine Learning Engineer

Join Sourcegraph as a Senior ML Engineer to revolutionize code intelligence with AI and NLP.

Technical Vision

Natural Language ..

9 months ago

Senior Software Engineer - LLM Inference

Join CentML as a Senior Software Engineer focusing on LLM Inference, leveraging AI, ML, and GPU technologies.

Parallel Programm..

Computer Architec..

Machine Learning

8 months ago

Senior Software Engineer, Machine Learning Infrastructure

Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.

Infrastructure as..

10 months ago

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

Back-End Web Deve..

Computer Engineer..

7 months ago

Senior Backend Engineer

Senior Backend Engineer at Speak, focusing on AI-driven language learning applications.

Problem Solving

Software Developm..

Back-End Web Deve..

a year ago

Senior Machine Learning Engineer - Scaling and Performance Optimization

Join InstaDeep as a Senior Machine Learning Engineer focusing on scaling and performance optimization in Paris. Work with cutting-edge AI technologies.

Software Engineer..

Machine Learning

8 months ago