Vultr logo

AI/ML Infrastructure Engineer

Vultr

Join Vultr as an AI/ML Infrastructure Engineer

Who We Are

Vultr is on a mission to make high-performance cloud computing easy to use, affordable, and locally accessible for businesses and developers around the world. With 32 cloud data center locations globally, Vultr has served over 1.5 million customers across 185 countries with flexible, scalable, global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage solutions. Founded by David Aninowsky and completely bootstrapped, Vultr has become the world’s largest privately-held cloud computing company without ever raising equity financing.

Why Vultr

Simply put, Vultr is committed to providing businesses worldwide with the best price-to-performance of any cloud computing platform. Our global reach of data centers and strategic new partnerships provide the foundation to maximize the impact of our existing services, new product improvements, and releases, which in turn, is a catalyst for your own success. Vultr is taking flight, and this is your opportunity to leave your mark on the future of Cloud Infrastructure!

Vultr Cares

  • A 100% remote work environment + a company-wide virtual get together
  • 401(k) plan that matches 100% up to 4% with immediate vesting
  • Professional Development Reimbursement of $2,500 each year
  • 11 Holidays + Paid Time Off Accrual + Rollover Plan + take off your birthday!
  • Commitment matters to Vultr! Increased PTO at 3 year anniversary + 1 month sabbatical at 5 year anniversary + Anniversary Bonus each year
  • $500 first year remote office setup + $400 each year following for new equipment
  • Monthly internet reimbursement up to $75
  • $50 per month for a gym membership

The Role

The Engineering team is a central pillar of our growth strategy, and we are looking for an AI/ML Infrastructure Engineer to help build and support our bare metal and GPU-based product offerings. You and your team will have ownership over the setup and provisioning of our GPU-based and bare metal systems and help drive engineering and operational excellence around our infrastructure. Our team’s mission is to provide a fast, performant, and stable infrastructure for all of our customers.

What To Expect

  • Developing and maintaining infrastructure in bare metal and containerized environments
  • Work directly with our networking team to build scalable and supportable GPU clusters
  • Ensure excellent customer experience by ensuring consistent and reliable provisioning of GPU infrastructure
  • Build and maintain test automation of GPU-based products to ensure fast and reliable provisioning
  • Implement and maintain GPU-based solutions to meet the needs of diverse applications and computational workloads
  • Conduct in-depth benchmarking, performance testing, and troubleshooting of GPU systems to identify and resolve any hardware or software limitations
  • Working with vendors to get all supported drivers and packages
  • Working with vendors on any bugs, performance-related issues, hardware problems, and reference architectures
  • Address any hardware, software, or performance issues promptly, coordinating with vendors, technical support, and internal teams as required

Our Ideal Candidate Will Have

  • Hands-on experience working with current, high-performance GPUs, primarily NVIDIA products (e.g. NVLink, Infiniband, GRID drivers, vGPU and NVAIE)
  • In-depth, hands-on experience working with and automating bare metal internals including BIOS, BMC, firmware, NICs, Redfish/IPMI, PCIe
  • Experience with rail optimization across multiple clusters and architectures
  • Experience with Linux, package management and device drivers
  • Experience with commercial firmware
  • Experience with Python, Bash, and PHP
  • Experience with Machine Learning software

Compensation

$120,000 - $150,000

Vultr is committed to an inclusive workforce where diversity is celebrated and supported. All employment decisions at Vultr are based on business needs, job requirements, and individual qualifications.

Vultr regards the lawful and correct use of personal information as important to the accomplishment of our objectives, to the success of our operations and to maintaining confidence between those with whom we deal and ourselves. As such the use of various key privacy controls enables Vultr’s treatment of personal information to meet current regulatory guidelines and laws.

Workforce members have the right under US state law where and when applicable and certain other privacy and data protection laws, as applicable, to: fair and equal treatment, knowing what personal data we gather and retain, for what purpose, and the ability to access and/or delete such data. You also have the right to opt out of communications from Vultr and approved third- parties at any time.

Benefits
Extracted with AI

  • 401(k)
  • Professional Development Reimbursement
  • Paid Time Off
  • Remote Work Environment
  • Internet Reimbursement
  • Gym Membership

Similar jobs

Last update: 23 minutes ago

Skild AI logo
Skild AI

Software Engineer, AI Training and Infrastructure

Join Skild AI as a Software Engineer to develop AI training infrastructure. Work with cutting-edge technologies in a dynamic team.

Keysight Technologies logo
Keysight Technologies

Machine Learning/AI Engineer

Join Keysight Technologies as a Machine Learning/AI Engineer to develop and optimize AI/ML models for EDA applications.

Voltai logo
Voltai

Software Engineer - AI Training Data

Join Voltai as a Software Engineer to build and optimize AI training data systems, focusing on semiconductor datasets.

Intel Corporation logo
Intel Corporation

Cloud Solution Engineer - GPU/Gaudi AI Accelerator

Join Intel as a Cloud Solution Engineer focusing on GPU/Gaudi AI Accelerator technologies for AI-driven applications.

GlobalLogic logo
GlobalLogic

Senior Machine Learning/Generative AI Engineer

Join GlobalLogic as a Senior ML/GenAI Engineer to develop and optimize AI chatbots using LLMs. Remote work available.

Lyra Health logo
Lyra Health

Senior AI/ML Infrastructure Engineer

Join Lyra Health as a Senior AI/ML Infrastructure Engineer to build scalable ML infrastructure. Work remotely with cutting-edge technologies.

Voltai logo
Voltai

Full Stack Engineer with JavaScript, React, and Python

Join Voltai as a Full Stack Engineer to build AI-driven web applications using JavaScript, React, and Python.

micro1 logo
micro1

Senior API Developer with AI and Python Expertise

Join us as a Senior API Developer to build AI-driven solutions using Python and Golang. Work remotely with top-tier companies.

Accrete AI logo
Accrete AI

Backend Engineer with Machine Learning Focus

Join Accrete AI as a Backend Engineer with a focus on machine learning, building scalable AI solutions.

G-P logo
G-P

Senior Full Stack Developer (AI Domain)

Join G-P as a Senior Full Stack Developer in the AI domain, focusing on frontend, backend, and cloud infrastructure.

Workiva logo
Workiva

Senior Software Engineer - Frontend with AI/ML Focus

Senior Software Engineer role focused on frontend development with AI/ML, using Dart and TypeScript, offering remote work.

Nexla logo
Nexla

Full Stack AI Engineer

Join Nexla as a Full Stack AI Engineer to develop AI-powered applications, enhance platform capabilities, and work with cutting-edge technologies.

Vanta logo
Vanta

Senior Software Engineer, AI Platform

Join Vanta as a Senior Software Engineer, AI Platform, to shape AI offerings and improve ML systems.

Ampere logo
Ampere

Senior Applied AI Model Researcher

Join Ampere as a Senior Applied AI Model Researcher to lead AI model development and optimization in a remote role.

DwellFi  logo
DwellFi

AI Solutions Software Engineer

Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama.

Resolve AI logo
Resolve AI

AI Engineer with LLM Expertise

Join Resolve AI as an AI Engineer in San Francisco to build AI-powered workflows with LLM expertise.

Vectra AI logo
Vectra AI

Senior Software Engineer - Python and Cloud

Join Vectra AI as a Senior Software Engineer in Dublin, focusing on Python, cloud, and cybersecurity.

Aviatrix logo
Aviatrix

Software Engineer (MTS) - Observability

Join Aviatrix as a Software Engineer (MTS) in Observability, focusing on network monitoring and cloud technologies.

Accrete AI logo
Accrete AI

Backend Engineer with Machine Learning Focus (Early Career)

Join Accrete AI as a Backend Engineer focusing on ML, working with Python, REST APIs, and cloud platforms. Early career role in New York.

Vectra AI logo
Vectra AI

Senior Software Engineer - Python and Cloud

Join Vectra AI as a Senior Software Engineer in Dublin, focusing on Python, cloud, and cybersecurity.

webAI logo
webAI

Staff Backend Engineer - Runtime Team Lead

Join webAI as a Staff Backend Engineer to lead the Runtime Team, focusing on distributed systems and high-performance engineering.

DwellFi  logo
DwellFi

AI Solutions Software Engineer

Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama.

micro1 logo
micro1

Machine Learning Engineer with AI/ML Experience

Join us as a Machine Learning Engineer to develop AI/ML models and applications. Work remotely with top-tier companies.

BIP logo
BIP

AI Engineer

Join BIP as an AI Engineer in Milan, leveraging AI, ML, and data science to create scalable solutions.