Deepgram logo

Senior Software Engineer, AI Inference

Deepgram

Company Overview

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI models including speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

Opportunity

We are seeking a backend engineer focused on AI inference to join the team powering Deepgram’s core speech inference APIs. You’ll implement and optimize inference code, experiment with cutting-edge technologies, and develop, maintain, and deploy the stack of services behind our blazing-fast, massive-throughput inference system. This role blends work on backend services and systems with domain specialty in neural networks and GPU programming. Our team owns the applications that serve api.deepgram.com and empowers builders of innovative speech products by focusing on a world-class combination of reliability, efficiency, and latency.

What You’ll Do

  • Implement inference for novel model architectures developed by Deepgram’s trailblazing research team
  • Develop, test, and deploy application code for massive-scale production services
  • Debug complex system issues that include networking, scheduling, and high-performance computing interactions
  • Build tooling for internal analysis and benchmarking to identify opportunities for efficiency improvements
  • Experiment with optimization techniques for ML workloads on NVIDIA GPUs and ship the key wins to prod

You’ll Love This Role If You

  • Think of yourself as a generalist while enjoying learning deeply in specific areas, causing you to go from debugging a customer issue one day to designing an algorithm the next
  • Like sipping piña coladas and getting caught in the rain
  • Enjoy taking ownership of features from early collaborations with researchers through testing in production
  • Love getting nitty-gritty with profilers, hardware architectures, and inference algorithms
  • Want to work within the context of a humble, collaborative team that collectively owns mission-critical production services

It’s Important to Us That You Have

  • The ability to work collaboratively in a fast-paced environment and adapt to changing priorities
  • Proven industry experience building and shipping production services
  • Strong confidence in a lower-level language like C, C++, or Rust
  • Experience slicing large projects or initiatives into smaller experiments or incremental improvements
  • Expertise in a ML framework like Torch or Tensorflow
  • Experience with GPU programming using tools like CUDA or libraries like cuDNN, cuBLAS, etc.

It Would Be Great If You Also Had

  • Extensive professional experience with Rust and C++
  • Experience optimizing ML workloads in production
  • Familiarity with GPU hardware architecture and its impact on inference pipelines

Benefits
Extracted with AI

  • Remote work flexibility

Similar jobs

Last update: 23 minutes ago

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

Together AI logo
Together AI

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

Grammarly logo
Grammarly

Entry Level Back-End Software Engineer (Java)

Join Grammarly as an Entry Level Back-End Software Engineer in Berlin. Work with Java, AWS, and more in a hybrid environment.

Climatiq logo
Climatiq

Senior Full Stack Engineer - Climate Tech - Rust & TypeScript

Join Climatiq as a Senior Full Stack Engineer to develop climate tech solutions using Rust and TypeScript. Remote work available.

Gorgias logo
Gorgias

Senior Full-Stack Engineer ReactJS/NodeJS

Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.

Persona logo
Persona

LLM Backend Developer

Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.

Instapro Group logo
Instapro Group

Senior Backend Engineer - Payments

Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.

Uber logo
Uber

Staff Software Engineer - Backend

Join Uber as a Staff Software Engineer - Backend, focusing on membership systems. Work with Java, Python, C++, and more in Amsterdam.

Sysdig logo
Sysdig

Senior Software Engineer - Backend Development

Join Sysdig as a Senior Software Engineer to develop scalable backend services using Go, RESTful APIs, and microservices in a hybrid work environment.

Uber logo
Uber

Staff Software Engineer - Backend

Join Uber as a Staff Software Engineer - Backend to develop and enhance solutions for millions of members globally.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Catalyze Group logo
Catalyze Group

Full Stack Developer with AI and API Expertise

Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.

ST Engineering iDirect logo
ST Engineering iDirect

Senior Software Engineer - Satellite Communications

Join ST Engineering iDirect as a Senior Software Engineer to develop satellite communication products using C++, Linux, and Agile methodologies.

GitLab logo
GitLab

Intermediate Backend Engineer - Ruby on Rails, PostgreSQL

Join GitLab as an Intermediate Backend Engineer to enhance CI platform performance using Ruby on Rails and PostgreSQL in a remote role.

Nebius AI logo
Nebius AI

Senior Backend Engineer (Go)

Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.

Instapro Group logo
Instapro Group

Senior Backend Engineer - PHP, Symfony, Laravel

Join Instapro Group as a Senior Backend Engineer, working with PHP, Symfony, and Laravel in a hybrid environment.

Uber logo
Uber

Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering

Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.

Neon logo
Neon

Software Engineer, Storage (Rust, PostgreSQL)

Join Neon as a Software Engineer, Storage. Work with Rust and PostgreSQL to build scalable, reliable cloud-native database services.

Carbon13 logo
Carbon13

Cofounder - Full Stack Developer/Data Scientist for Climatech Startup

Join Carbon13 as a cofounder in climate tech, leveraging AI, data science, and software development to combat climate change.

Reaktor logo
Reaktor

Lead Developer with DevOps and Functional Programming

Join Reaktor as a Lead Developer in Amsterdam, focusing on DevOps, Functional Programming, and JavaScript in a hybrid work environment.

LEGALFLY logo
LEGALFLY

Back End Engineer with Node.js and TypeScript

Join LegalFly as a Back End Engineer to revolutionize legal AI with Node.js and TypeScript in a hybrid role in Ghent.

Binance logo
Binance

Senior Backend Developer (Node.js) / SRE

Join Binance as a Senior Backend Developer (Node.js) / SRE to develop monitoring systems for high-load production environments.