Deepgram logo

Senior Software Engineer, AI Inference

Deepgram

Company Overview

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI models including speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

Opportunity

We are seeking a backend engineer focused on AI inference to join the team powering Deepgram’s core speech inference APIs. You’ll implement and optimize inference code, experiment with cutting-edge technologies, and develop, maintain, and deploy the stack of services behind our blazing-fast, massive-throughput inference system. This role blends work on backend services and systems with domain specialty in neural networks and GPU programming. Our team owns the applications that serve api.deepgram.com and empowers builders of innovative speech products by focusing on a world-class combination of reliability, efficiency, and latency.

What You’ll Do

  • Implement inference for novel model architectures developed by Deepgram’s trailblazing research team
  • Develop, test, and deploy application code for massive-scale production services
  • Debug complex system issues that include networking, scheduling, and high-performance computing interactions
  • Build tooling for internal analysis and benchmarking to identify opportunities for efficiency improvements
  • Experiment with optimization techniques for ML workloads on NVIDIA GPUs and ship the key wins to prod

You’ll Love This Role If You

  • Think of yourself as a generalist while enjoying learning deeply in specific areas, causing you to go from debugging a customer issue one day to designing an algorithm the next
  • Like sipping piña coladas and getting caught in the rain
  • Enjoy taking ownership of features from early collaborations with researchers through testing in production
  • Love getting nitty-gritty with profilers, hardware architectures, and inference algorithms
  • Want to work within the context of a humble, collaborative team that collectively owns mission-critical production services

It’s Important to Us That You Have

  • The ability to work collaboratively in a fast-paced environment and adapt to changing priorities
  • Proven industry experience building and shipping production services
  • Strong confidence in a lower-level language like C, C++, or Rust
  • Experience slicing large projects or initiatives into smaller experiments or incremental improvements
  • Expertise in a ML framework like Torch or Tensorflow
  • Experience with GPU programming using tools like CUDA or libraries like cuDNN, cuBLAS, etc.

It Would Be Great If You Also Had

  • Extensive professional experience with Rust and C++
  • Experience optimizing ML workloads in production
  • Familiarity with GPU hardware architecture and its impact on inference pipelines

Benefits
Extracted with AI

  • Remote work flexibility

Similar jobs

Last update: 23 minutes ago

Deepgram logo
Deepgram

Software Engineer, Front End

Join Deepgram as a Front End Software Engineer to build innovative voice AI applications using Elm and React.

Groq logo
Groq

Senior/Principal Software Engineer

Join Groq as a Senior/Principal Software Engineer to design and develop scalable software for AI inference technology.

Groq logo
Groq

Software Engineer, Developer Experience

Join Groq as a Software Engineer to enhance APIs for fast AI applications, focusing on optimization and high-performance computing.

DigitalOcean logo
DigitalOcean

Senior Software Engineer (AI/ML)

Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.

Standard AI logo
Standard AI

Senior Software Engineer, Rust

Senior Software Engineer specializing in Rust, remote role in Italy, focusing on high-performance systems and cloud services.

Hamming AI logo
Hamming AI

Senior Backend Engineer

Join Hamming AI as a Senior Backend Engineer to scale AI voice agent systems. Work with Next.JS, TypeScript, Python, and more.

Amazon logo
Amazon

Senior Software Engineer - Generative AI, AGI Inference Engine

Join Amazon as a Senior Software Engineer to advance Generative AI capabilities, focusing on high-performance inference.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Perplexity logo
Perplexity

AI Inference Engineer

Join Perplexity as an AI Inference Engineer in San Francisco, focusing on large-scale ML model deployments and system reliability.

Play logo
Play

Senior Software Engineer - Generative AI Voice Technology

Senior Software Engineer role in Generative AI Voice Technology, pioneering large voice models with competitive benefits.

Iris logo
Iris

Senior AI Engineer

Senior AI Engineer role focusing on Go, TypeScript, Python in a remote-first, innovative AI platform.

Essential AI logo
Essential AI

Senior Backend Engineer, Forward Deployed

Senior Backend Engineer role focusing on infrastructure, customer solutions, and deployment in San Francisco.

AssemblyAI logo
AssemblyAI

Software Engineer - Developer Foundations

Join AssemblyAI as a Software Engineer to build scalable backend systems and developer tools. Work remotely with cutting-edge AI technology.

DigitalOcean logo
DigitalOcean

Senior Software Engineer (AI/ML)

Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.

Discord logo
Discord

Senior Software Engineer - Media Infrastructure

Join Discord as a Senior Software Engineer in Media Infrastructure, focusing on backend development, Python, Rust, and media management.

Together AI logo
Together AI

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

Amazon logo
Amazon

Senior Software Engineer - Generative AI

Join Amazon as a Senior Software Engineer in Generative AI, focusing on high-performance inference capabilities.

Discord logo
Discord

Senior Software Engineer, Machine Learning Platform

Join Discord as a Senior Software Engineer in Machine Learning Platform, working remotely with AWS, Generative AI, and real-time data.

Meaningful Gigs logo
Meaningful Gigs

Senior Software Engineer - Backend (Founding Member)

Senior Backend Software Engineer role focusing on innovative AI-driven tools, remote work, competitive benefits.

Sourcegraph logo
Sourcegraph

Senior Machine Learning Engineer

Join Sourcegraph as a Senior ML Engineer to revolutionize code intelligence with AI and NLP.

CentML logo
CentML

Senior Software Engineer - LLM Inference

Join CentML as a Senior Software Engineer focusing on LLM Inference, leveraging AI, ML, and GPU technologies.

Scale AI logo
Scale AI

Senior Software Engineer, Machine Learning Infrastructure

Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.

Speak logo
Speak

Senior Backend Engineer

Senior Backend Engineer at Speak, focusing on AI-driven language learning applications.

InstaDeep logo
InstaDeep

Senior Machine Learning Engineer - Scaling and Performance Optimization

Join InstaDeep as a Senior Machine Learning Engineer focusing on scaling and performance optimization in Paris. Work with cutting-edge AI technologies.