NVIDIA logo

Machine Learning Engineer - LLM Fine-tuning and Performance

NVIDIA

Job Overview

As a Machine Learning Engineer (LLM Fine-tuning & Performance Specialist), you will play a crucial role in enhancing the accuracy and performance of fine-tuned Large Language Models (LLMs) for real-world applications. This groundbreaking opportunity allows you to work with cutting-edge ML technologies, collaborating closely with partners to drive innovation and ensure smooth integration and deployment of ML solutions. Your expertise will be essential in automating ML workflows and optimizing performance, making a lasting impact in the field of AI.

Key Responsibilities

  • Develop, implement, and manage processes for optimizing LLMs using domain-specific data, improving the model's capability to complete structured tasks.
  • Develop and implement techniques to measure LLM performance, defining and monitoring metrics such as recall, F1, perplexity, BLEU, ROUGE, etc.
  • Utilize tools like ONNX and TensorRT for optimizing model inference on specialized hardware.
  • Collaborate with ISVs and IHVs to understand their performance requirements and ensure successful model integration.
  • Use C++ to improve ML model performance, specifically in performance-critical systems, and provide technical mentorship to junior engineers.

Qualifications

  • 8+ years of validated experience in system software or related field.
  • M.S. or higher degree in Computer Science/Data Science/Engineering or equivalent experience.
  • Deep understanding of transformer architectures and large language models like GPT, BERT, T5, or similar.
  • Validated hands-on experience with fine-tuning LLMs for specific tasks and improving model performance using libraries like PyTorch.
  • Strong ability to assess and optimize model performance using relevant metrics and evaluation techniques.
  • Proficiency in crafting and automating ML workflows using tools such as Kubeflow, MLflow, or Airflow.
  • Excellent problem-solving skills, especially in debugging and improving LLM accuracy for real-world applications.
  • Proficiency in Python and knowledge of C++ for optimizing performance and developing system-level integrations.
  • Strong interpersonal skills for effective collaboration with internal teams and external partners.

Preferred Qualifications

  • Experience with LLM-based function and tool calling systems.
  • Understanding of distributed training for LLM fine-tuning and cloud platforms like Nvidia's NVCF.
  • Familiarity with hardware acceleration for ML workloads, including GPU and specialized hardware optimizations.

Compensation

The base salary range is $180,000 - $339,250 USD annually. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.

Diversity and Inclusion

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We highly value diversity in our current and future employees and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Application Process

NVIDIA accepts applications on an ongoing basis. Join us in shaping the future of AI and computing.

Job ID: JR1990034

Benefits
Extracted with AI

  • Equity
  • Diverse work environment

Similar jobs

Last update: 23 minutes ago

FoodLabs logo
FoodLabs

Senior C++ Computer Vision Engineer

Join a cutting-edge AI-DeepTech startup in Berlin as a Senior C++ Computer Vision Engineer. Work on world-class on-device AI technology.

Persona logo
Persona

LLM Backend Developer

Join Persona as a LLM Backend Developer, work remotely, and develop AI-driven backend systems for top startups.

dataroots logo
dataroots

Expert Machine Learning Engineer

Join Dataroots as an Expert Machine Learning Engineer to design and deliver AI-powered solutions, focusing on machine learning models.

DeepL logo
DeepL

Senior Backend Engineer C++

Join DeepL as a Senior Backend Engineer C++ to design and maintain scalable backend services using C++ and AI technologies.

Huawei Nederland logo
Huawei Nederland

Information Retrieval Algorithm Engineer

Join Huawei as an Information Retrieval Algorithm Engineer to develop cutting-edge AI technologies in Amsterdam.

yourfirm GmbH logo
yourfirm GmbH

Senior Fullstack Developer for AI-Driven Mission Technologies

Seeking a Senior Fullstack Developer for AI-driven mission technologies, focusing on Java, JavaScript, Python, and C++. Remote work available.

BCG X logo
BCG X

AI Engineer

Join BCG X as an AI Engineer in Milan, Italy. Develop AI solutions, partner with clients, and drive innovation in a dynamic environment.

Skytree logo
Skytree

Senior IoT Engineer

Join Skytree as a Senior IoT Engineer to lead IoT projects, focusing on Azure IoT solutions, edge computing, and data pipelines.

Huawei Nederland logo
Huawei Nederland

Senior ASR / TTS Researcher

Join Huawei's research center in Amsterdam as a Senior ASR/TTS Researcher, focusing on speech synthesis and AI.

Reaktor logo
Reaktor

Lead Developer with DevOps and Functional Programming

Join Reaktor as a Lead Developer in Amsterdam, focusing on DevOps, Functional Programming, and JavaScript in a hybrid work environment.

Computer Futures logo
Computer Futures

Cloud Data Engineer

Seeking a Cloud Data Engineer with expertise in AWS, Python, and CI/CD for a hybrid role in Hannover. Join our dynamic team!

LEGALFLY logo
LEGALFLY

Back End Engineer with Node.js and TypeScript

Join LegalFly as a Back End Engineer to revolutionize legal AI with Node.js and TypeScript in a hybrid role in Ghent.

Holland Casino logo
Holland Casino

Data Engineer with ETL and SQL Expertise

Join Holland Casino as a Data Engineer to build and maintain data infrastructure for the Online Casino, focusing on ETL, SQL, and cloud solutions.

NN Group logo
NN Group

Senior Full-stack Engineer (Angular, Node.js, TypeScript)

Join NN Group as a Senior Full-stack Engineer, leading software architecture and development with Angular, Node.js, and TypeScript.

Optiver logo
Optiver

Production Engineer

Join Optiver as a Production Engineer in Amsterdam to manage live trading environments and enhance system reliability and performance.

Uber logo
Uber

Staff Software Engineer, Fullstack, Capacity & Efficiency Engineering

Join Uber as a Staff Software Engineer in Amsterdam, focusing on fullstack development and capacity efficiency engineering.

i4talent detachering logo
i4talent detachering

Senior Data Engineer

Join i4talent as a Senior Data Engineer to lead cloud transitions and data projects. Enjoy a fun work environment with great benefits.

Darktrace logo
Darktrace

Solutions Engineer

Join Darktrace as a Solutions Engineer in Amsterdam, providing technical pre-sales and post-sales support in a hybrid work environment.

Reddit, Inc. logo
Reddit, Inc.

Senior Solutions Engineer

Join Reddit as a Senior Solutions Engineer in Amsterdam to support our growing advertising business with technical expertise and problem-solving skills.

Aiven logo
Aiven

Staff Software Engineer

Join Aiven as a Staff Software Engineer to develop cloud operations platforms using open-source technologies. Hybrid work in Berlin.

Catalyze Group logo
Catalyze Group

Full Stack Developer with AI and API Expertise

Join Catalyze Group as a Full Stack Developer to build AI-powered grant-writing tools. Work with React, Django, and more in Amsterdam.

Tibo Energy Management Software logo
Tibo Energy Management Software

Cloud Engineer

Join Tibo Energy as a Cloud Engineer to drive energy transition with cloud architecture skills in a dynamic team.

Carbon13 logo
Carbon13

Cofounder - Full Stack Developer/Data Scientist for Climatech Startup

Join Carbon13 as a cofounder in climate tech, leveraging AI, data science, and software development to combat climate change.

Gorgias logo
Gorgias

Senior Full-Stack Engineer ReactJS/NodeJS

Join Gorgias as a Senior Full-Stack Engineer specializing in ReactJS and NodeJS, enhancing AI-powered ecommerce solutions.