Deepgram logo

Senior Software Engineer, AI Inference

Deepgram

Company Overview

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI models including speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

Opportunity

We are seeking a backend engineer focused on AI inference to join the team powering Deepgram’s core speech inference APIs. You’ll implement and optimize inference code, experiment with cutting-edge technologies, and develop, maintain, and deploy the stack of services behind our blazing-fast, massive-throughput inference system. This role blends work on backend services and systems with domain specialty in neural networks and GPU programming. Our team owns the applications that serve api.deepgram.com and empowers builders of innovative speech products by focusing on a world-class combination of reliability, efficiency, and latency.

What You’ll Do

  • Implement inference for novel model architectures developed by Deepgram’s trailblazing research team
  • Develop, test, and deploy application code for massive-scale production services
  • Debug complex system issues that include networking, scheduling, and high-performance computing interactions
  • Build tooling for internal analysis and benchmarking to identify opportunities for efficiency improvements
  • Experiment with optimization techniques for ML workloads on NVIDIA GPUs and ship the key wins to prod

You’ll Love This Role If You

  • Think of yourself as a generalist while enjoying learning deeply in specific areas, causing you to go from debugging a customer issue one day to designing an algorithm the next
  • Like sipping piña coladas and getting caught in the rain
  • Enjoy taking ownership of features from early collaborations with researchers through testing in production
  • Love getting nitty-gritty with profilers, hardware architectures, and inference algorithms
  • Want to work within the context of a humble, collaborative team that collectively owns mission-critical production services

It’s Important to Us That You Have

  • The ability to work collaboratively in a fast-paced environment and adapt to changing priorities
  • Proven industry experience building and shipping production services
  • Strong confidence in a lower-level language like C, C++, or Rust
  • Experience slicing large projects or initiatives into smaller experiments or incremental improvements
  • Expertise in a ML framework like Torch or Tensorflow
  • Experience with GPU programming using tools like CUDA or libraries like cuDNN, cuBLAS, etc.

It Would Be Great If You Also Had

  • Extensive professional experience with Rust and C++
  • Experience optimizing ML workloads in production
  • Familiarity with GPU hardware architecture and its impact on inference pipelines

Benefits
Extracted with AI

  • Remote work flexibility

Similar jobs

Last update: 23 minutes ago

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Together AI logo
Together AI

Senior Backend Engineer - Java, Rust, Go

Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

Grammarly logo
Grammarly

Entry Level Back-End Software Engineer (Java)

Join Grammarly as an Entry Level Back-End Software Engineer in Berlin. Work with Java, AWS, and more in a hybrid environment.

Sysdig logo
Sysdig

Senior Software Engineer - Backend Development

Join Sysdig as a Senior Software Engineer to develop scalable backend services using Go, RESTful APIs, and microservices in a hybrid work environment.

Climatiq logo
Climatiq

Senior Full Stack Engineer - Climate Tech - Rust & TypeScript

Join Climatiq as a Senior Full Stack Engineer to develop climate tech solutions using Rust and TypeScript. Remote work available.

Persona logo
Persona

LLM Backend Developer

Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.

Nebius AI logo
Nebius AI

Senior Backend Engineer (Go)

Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Sysdig logo
Sysdig

Senior Software Engineer - .NET and Go

Join Sysdig as a Senior Software Engineer in Berlin, focusing on .NET and Go for cloud security solutions.

Catalyze Group logo
Catalyze Group

Full Stack Developer with AI and API Expertise

Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.

Attio logo
Attio

Senior Product Engineer [Rust & Typescript]

Join Attio as a Senior Product Engineer working with Rust & TypeScript to build innovative CRM features. Remote work available.

Instapro Group logo
Instapro Group

Senior Backend Engineer - Payments

Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.

Uber logo
Uber

Staff Software Engineer - Backend

Join Uber as a Staff Software Engineer - Backend, focusing on membership systems. Work with Java, Python, C++, and more in Amsterdam.

Gorgias logo
Gorgias

Senior Full-Stack Engineer ReactJS/NodeJS

Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.

Neon logo
Neon

Software Engineer, Storage (Rust, PostgreSQL)

Join Neon as a Software Engineer, Storage. Work with Rust and PostgreSQL to build scalable, reliable cloud-native database services.

GitLab logo
GitLab

Intermediate Backend Engineer - Ruby on Rails, PostgreSQL

Join GitLab as an Intermediate Backend Engineer to enhance CI platform performance using Ruby on Rails and PostgreSQL in a remote role.

Uber logo
Uber

Staff Software Engineer - Backend

Join Uber as a Staff Software Engineer - Backend to develop and enhance solutions for millions of members globally.

LEGALFLY logo
LEGALFLY

Back End Engineer with Node.js and TypeScript

Join LegalFly as a Back End Engineer to revolutionize legal AI with Node.js and TypeScript in a hybrid role in Ghent.

Nebius AI logo
Nebius AI

Senior Software Engineer (C++)

Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

Uber logo
Uber

Software Engineer - Backend - Membership

Join Uber as a Backend Software Engineer to develop member-first experiences in a collaborative team, impacting global foundations.

DOWN Dating & Social Apps logo
DOWN Dating & Social Apps

Senior Backend Developer (Ruby on Rails)

Join our team as a Senior Backend Developer specializing in Ruby on Rails for a remote position with a social apps company.