Company Overview
Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI models including speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.
Opportunity
We are seeking a backend engineer focused on AI inference to join the team powering Deepgram’s core speech inference APIs. You’ll implement and optimize inference code, experiment with cutting-edge technologies, and develop, maintain, and deploy the stack of services behind our blazing-fast, massive-throughput inference system. This role blends work on backend services and systems with domain specialty in neural networks and GPU programming. Our team owns the applications that serve api.deepgram.com and empowers builders of innovative speech products by focusing on a world-class combination of reliability, efficiency, and latency.
What You’ll Do
- Implement inference for novel model architectures developed by Deepgram’s trailblazing research team
- Develop, test, and deploy application code for massive-scale production services
- Debug complex system issues that include networking, scheduling, and high-performance computing interactions
- Build tooling for internal analysis and benchmarking to identify opportunities for efficiency improvements
- Experiment with optimization techniques for ML workloads on NVIDIA GPUs and ship the key wins to prod
You’ll Love This Role If You
- Think of yourself as a generalist while enjoying learning deeply in specific areas, causing you to go from debugging a customer issue one day to designing an algorithm the next
- Like sipping piña coladas and getting caught in the rain
- Enjoy taking ownership of features from early collaborations with researchers through testing in production
- Love getting nitty-gritty with profilers, hardware architectures, and inference algorithms
- Want to work within the context of a humble, collaborative team that collectively owns mission-critical production services
It’s Important to Us That You Have
- The ability to work collaboratively in a fast-paced environment and adapt to changing priorities
- Proven industry experience building and shipping production services
- Strong confidence in a lower-level language like C, C++, or Rust
- Experience slicing large projects or initiatives into smaller experiments or incremental improvements
- Expertise in a ML framework like Torch or Tensorflow
- Experience with GPU programming using tools like CUDA or libraries like cuDNN, cuBLAS, etc.
It Would Be Great If You Also Had
- Extensive professional experience with Rust and C++
- Experience optimizing ML workloads in production
- Familiarity with GPU hardware architecture and its impact on inference pipelines
Benefits Extracted with AI
- Remote work flexibility
Similar jobs
Last update: 23 minutes ago
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
Senior Backend Engineer - Java, Rust, Go
Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.
Senior C++ Computer Vision Engineer
Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.
Senior Fullstack Developer for AI-Driven Mission Technologies
Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.
Entry Level Back-End Software Engineer (Java)
Join Grammarly as an Entry Level Back-End Software Engineer in Berlin. Work with Java, AWS, and more in a hybrid environment.
Senior Software Engineer - Backend Development
Join Sysdig as a Senior Software Engineer to develop scalable backend services using Go, RESTful APIs, and microservices in a hybrid work environment.
Senior Full Stack Engineer - Climate Tech - Rust & TypeScript
Join Climatiq as a Senior Full Stack Engineer to develop climate tech solutions using Rust and TypeScript. Remote work available.
LLM Backend Developer
Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.
Senior Backend Engineer (Go)
Join Nebius as a Senior Backend Engineer (Go) to develop fault-tolerant cloud services in a hybrid work environment.
Staff Software Engineer
Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.
Senior Software Engineer - .NET and Go
Join Sysdig as a Senior Software Engineer in Berlin, focusing on .NET and Go for cloud security solutions.
Full Stack Developer with AI and API Expertise
Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.
Senior Product Engineer [Rust & Typescript]
Join Attio as a Senior Product Engineer working with Rust & TypeScript to build innovative CRM features. Remote work available.
Senior Backend Engineer - Payments
Join Instapro Group as a Senior Backend Engineer in Berlin, focusing on PHP and payment systems in a hybrid work environment.
Staff Software Engineer - Backend
Join Uber as a Staff Software Engineer - Backend, focusing on membership systems. Work with Java, Python, C++, and more in Amsterdam.
Senior Full-Stack Engineer ReactJS/NodeJS
Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.
Software Engineer, Storage (Rust, PostgreSQL)
Join Neon as a Software Engineer, Storage. Work with Rust and PostgreSQL to build scalable, reliable cloud-native database services.
Intermediate Backend Engineer - Ruby on Rails, PostgreSQL
Join GitLab as an Intermediate Backend Engineer to enhance CI platform performance using Ruby on Rails and PostgreSQL in a remote role.
Staff Software Engineer - Backend
Join Uber as a Staff Software Engineer - Backend to develop and enhance solutions for millions of members globally.
Back End Engineer with Node.js and TypeScript
Join LegalFly as a Back End Engineer to revolutionize legal AI with Node.js and TypeScript in a hybrid role in Ghent.
Senior Software Engineer (C++)
Join Nebius as a Senior Software Engineer (C++) to develop reliable cloud services in a hybrid work environment.
Expert Machine Learning Engineer
Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.
Software Engineer - Backend - Membership
Join Uber as a Backend Software Engineer to develop member-first experiences in a collaborative team, impacting global foundations.
Senior Backend Developer (Ruby on Rails)
Join our team as a Senior Backend Developer specializing in Ruby on Rails for a remote position with a social apps company.