Company Overview
Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI models including speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.
Opportunity
We are seeking a backend engineer focused on AI inference to join the team powering Deepgram’s core speech inference APIs. You’ll implement and optimize inference code, experiment with cutting-edge technologies, and develop, maintain, and deploy the stack of services behind our blazing-fast, massive-throughput inference system. This role blends work on backend services and systems with domain specialty in neural networks and GPU programming. Our team owns the applications that serve api.deepgram.com and empowers builders of innovative speech products by focusing on a world-class combination of reliability, efficiency, and latency.
What You’ll Do
- Implement inference for novel model architectures developed by Deepgram’s trailblazing research team
- Develop, test, and deploy application code for massive-scale production services
- Debug complex system issues that include networking, scheduling, and high-performance computing interactions
- Build tooling for internal analysis and benchmarking to identify opportunities for efficiency improvements
- Experiment with optimization techniques for ML workloads on NVIDIA GPUs and ship the key wins to prod
You’ll Love This Role If You
- Think of yourself as a generalist while enjoying learning deeply in specific areas, causing you to go from debugging a customer issue one day to designing an algorithm the next
- Like sipping piña coladas and getting caught in the rain
- Enjoy taking ownership of features from early collaborations with researchers through testing in production
- Love getting nitty-gritty with profilers, hardware architectures, and inference algorithms
- Want to work within the context of a humble, collaborative team that collectively owns mission-critical production services
It’s Important to Us That You Have
- The ability to work collaboratively in a fast-paced environment and adapt to changing priorities
- Proven industry experience building and shipping production services
- Strong confidence in a lower-level language like C, C++, or Rust
- Experience slicing large projects or initiatives into smaller experiments or incremental improvements
- Expertise in a ML framework like Torch or Tensorflow
- Experience with GPU programming using tools like CUDA or libraries like cuDNN, cuBLAS, etc.
It Would Be Great If You Also Had
- Extensive professional experience with Rust and C++
- Experience optimizing ML workloads in production
- Familiarity with GPU hardware architecture and its impact on inference pipelines
Benefits Extracted with AI
- Remote work flexibility
Similar jobs
Last update: 23 minutes ago
Software Engineer, Front End
Join Deepgram as a Front End Software Engineer to build innovative voice AI applications using Elm and React.
Senior/Principal Software Engineer
Join Groq as a Senior/Principal Software Engineer to design and develop scalable software for AI inference technology.
Software Engineer, Developer Experience
Join Groq as a Software Engineer to enhance APIs for fast AI applications, focusing on optimization and high-performance computing.
Senior Software Engineer (AI/ML)
Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.
Senior Software Engineer, Rust
Senior Software Engineer specializing in Rust, remote role in Italy, focusing on high-performance systems and cloud services.
Senior Backend Engineer
Join Hamming AI as a Senior Backend Engineer to scale AI voice agent systems. Work with Next.JS, TypeScript, Python, and more.
Senior Software Engineer - Generative AI, AGI Inference Engine
Join Amazon as a Senior Software Engineer to advance Generative AI capabilities, focusing on high-performance inference.
Senior Backend Engineer C++
Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.
AI Inference Engineer
Join Perplexity as an AI Inference Engineer in San Francisco, focusing on large-scale ML model deployments and system reliability.
Senior Software Engineer - Generative AI Voice Technology
Senior Software Engineer role in Generative AI Voice Technology, pioneering large voice models with competitive benefits.
Senior AI Engineer
Senior AI Engineer role focusing on Go, TypeScript, Python in a remote-first, innovative AI platform.
Senior Backend Engineer, Forward Deployed
Senior Backend Engineer role focusing on infrastructure, customer solutions, and deployment in San Francisco.
Software Engineer - Developer Foundations
Join AssemblyAI as a Software Engineer to build scalable backend systems and developer tools. Work remotely with cutting-edge AI technology.
Senior Software Engineer (AI/ML)
Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.
Senior Software Engineer - Media Infrastructure
Join Discord as a Senior Software Engineer in Media Infrastructure, focusing on backend development, Python, Rust, and media management.
Senior Backend Engineer - Java, Rust, Go
Join Together AI as a Senior Backend Engineer in Amsterdam. Work with Java, Rust, and Go to build scalable backend systems.
Senior Software Engineer - Generative AI
Join Amazon as a Senior Software Engineer in Generative AI, focusing on high-performance inference capabilities.
Senior Software Engineer, Machine Learning Platform
Join Discord as a Senior Software Engineer in Machine Learning Platform, working remotely with AWS, Generative AI, and real-time data.
Senior Software Engineer - Backend (Founding Member)
Senior Backend Software Engineer role focusing on innovative AI-driven tools, remote work, competitive benefits.
Senior Machine Learning Engineer
Join Sourcegraph as a Senior ML Engineer to revolutionize code intelligence with AI and NLP.
Senior Software Engineer - LLM Inference
Join CentML as a Senior Software Engineer focusing on LLM Inference, leveraging AI, ML, and GPU technologies.
Senior Software Engineer, Machine Learning Infrastructure
Join Scale AI as a Senior Software Engineer in Machine Learning Infrastructure, focusing on backend system design and ML Infrastructure.
Senior Backend Engineer
Senior Backend Engineer at Speak, focusing on AI-driven language learning applications.
Senior Machine Learning Engineer - Scaling and Performance Optimization
Join InstaDeep as a Senior Machine Learning Engineer focusing on scaling and performance optimization in Paris. Work with cutting-edge AI technologies.