AI/ML Infrastructure Engineer

Join Vultr as an AI/ML Infrastructure Engineer

Who We Are

Vultr is on a mission to make high-performance cloud computing easy to use, affordable, and locally accessible for businesses and developers around the world. With 32 cloud data center locations globally, Vultr has served over 1.5 million customers across 185 countries with flexible, scalable, global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage solutions. Founded by David Aninowsky and completely bootstrapped, Vultr has become the world’s largest privately-held cloud computing company without ever raising equity financing.

Why Vultr

Simply put, Vultr is committed to providing businesses worldwide with the best price-to-performance of any cloud computing platform. Our global reach of data centers and strategic new partnerships provide the foundation to maximize the impact of our existing services, new product improvements, and releases, which in turn, is a catalyst for your own success. Vultr is taking flight, and this is your opportunity to leave your mark on the future of Cloud Infrastructure!

Vultr Cares

A 100% remote work environment + a company-wide virtual get together
401(k) plan that matches 100% up to 4% with immediate vesting
Professional Development Reimbursement of $2,500 each year
11 Holidays + Paid Time Off Accrual + Rollover Plan + take off your birthday!
Commitment matters to Vultr! Increased PTO at 3 year anniversary + 1 month sabbatical at 5 year anniversary + Anniversary Bonus each year
$500 first year remote office setup + $400 each year following for new equipment
Monthly internet reimbursement up to $75
$50 per month for a gym membership

The Role

The Engineering team is a central pillar of our growth strategy, and we are looking for an AI/ML Infrastructure Engineer to help build and support our bare metal and GPU-based product offerings. You and your team will have ownership over the setup and provisioning of our GPU-based and bare metal systems and help drive engineering and operational excellence around our infrastructure. Our team’s mission is to provide a fast, performant, and stable infrastructure for all of our customers.

What To Expect

Developing and maintaining infrastructure in bare metal and containerized environments
Work directly with our networking team to build scalable and supportable GPU clusters
Ensure excellent customer experience by ensuring consistent and reliable provisioning of GPU infrastructure
Build and maintain test automation of GPU-based products to ensure fast and reliable provisioning
Implement and maintain GPU-based solutions to meet the needs of diverse applications and computational workloads
Conduct in-depth benchmarking, performance testing, and troubleshooting of GPU systems to identify and resolve any hardware or software limitations
Working with vendors to get all supported drivers and packages
Working with vendors on any bugs, performance-related issues, hardware problems, and reference architectures
Address any hardware, software, or performance issues promptly, coordinating with vendors, technical support, and internal teams as required

Our Ideal Candidate Will Have

Hands-on experience working with current, high-performance GPUs, primarily NVIDIA products (e.g. NVLink, Infiniband, GRID drivers, vGPU and NVAIE)
In-depth, hands-on experience working with and automating bare metal internals including BIOS, BMC, firmware, NICs, Redfish/IPMI, PCIe
Experience with rail optimization across multiple clusters and architectures
Experience with Linux, package management and device drivers
Experience with commercial firmware
Experience with Python, Bash, and PHP
Experience with Machine Learning software

Compensation

$120,000 - $150,000

Vultr is committed to an inclusive workforce where diversity is celebrated and supported. All employment decisions at Vultr are based on business needs, job requirements, and individual qualifications.

Vultr regards the lawful and correct use of personal information as important to the accomplishment of our objectives, to the success of our operations and to maintaining confidence between those with whom we deal and ourselves. As such the use of various key privacy controls enables Vultr’s treatment of personal information to meet current regulatory guidelines and laws.

Workforce members have the right under US state law where and when applicable and certain other privacy and data protection laws, as applicable, to: fair and equal treatment, knowing what personal data we gather and retain, for what purpose, and the ability to access and/or delete such data. You also have the right to opt out of communications from Vultr and approved third- parties at any time.

Benefits
Extracted with AI

401(k)
Professional Development Reimbursement
Paid Time Off
Remote Work Environment
Internet Reimbursement
Gym Membership

Similar jobs

Last update: 23 minutes ago

Helm.ai

Remote Software Engineer - Machine Learning and Cloud Infrastructure

Join Helm.ai as a Remote Software Engineer to develop ML tools, build cloud infrastructure, and work on AI technology.

9 months ago

Frontend Engineer with React and CSS Frameworks

Join Vultr as a Frontend Engineer to build marketing collateral using React and CSS frameworks.

9 months ago

Frontend Engineer with React.js and Bootstrap

Join Vultr as a Frontend Engineer to build and enhance our public-facing website using React.js and Bootstrap.

9 months ago

Software Engineer (AI/ML Production Engineering)

Join WP Engine as a Software Engineer focusing on AI/ML Production Engineering. Enhance our platform with machine learning and AIOps.

10 months ago

Senior Software Engineer (AI/ML)

Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.

a year ago

Machine Learning Engineer with AI/ML Experience

Join us as a Machine Learning Engineer to develop AI/ML models and applications. Work remotely with top-tier companies.

9 months ago

AI Engineer with Full-Stack Development Skills

Join Vizcom as an AI Engineer to develop cutting-edge AI models and integrate them into our design platform. Remote work, full-stack skills required.

9 months ago

Senior Software Engineer - Infrastructure

Join Voxel as a Senior Software Engineer - Infrastructure to build cloud infrastructure and distributed systems for AI-driven workplace safety.

10 months ago

Software Engineer - AI

Join Uplimit as a Software Engineer - AI to build innovative AI-driven learning solutions. Work on cutting-edge projects in a hybrid environment.

9 months ago

Cloud Solution Engineer - GPU/Gaudi AI Accelerator

Join Intel as a Cloud Solution Engineer focusing on GPU/Gaudi AI Accelerator technologies for AI-driven applications.

8 months ago

AI Infrastructure Engineer

Join as an AI Infrastructure Engineer to design, deploy, and maintain cloud solutions for AI workloads. Work with Fortune 500 companies globally.

a year ago

Software Engineer, AI Training and Infrastructure

Join Skild AI as a Software Engineer to develop AI training infrastructure. Work with cutting-edge technologies in a dynamic team.

8 months ago

Senior AI Engineer with Focus on Large Language Models

Join Vendr as a Senior AI Engineer focusing on LLMs, leveraging tools like OpenAI, Anthropic, and Hugging Face.

10 months ago

Senior Software Engineer (AI/ML)

Join DigitalOcean as a Senior Software Engineer (AI/ML) to build AI/ML features using TypeScript, React, and GraphQL. Remote role with competitive benefits.

a year ago

Senior Software Engineer, AI

Join as a Senior Software Engineer, AI, to innovate AI features in a remote-friendly environment. Enhance CRM with cutting-edge AI.

10 months ago

AMD

AI/ML Software Engineer

Join AMD as an AI/ML Software Engineer to lead next-gen architecture development in a remote role. Strong C++, Python, and ML framework skills required.

10 months ago