NVIDIA logo

Machine Learning Engineer - LLM Fine-tuning and Performance

NVIDIA

Job Overview

As a Machine Learning Engineer (LLM Fine-tuning & Performance Specialist), you will play a crucial role in enhancing the accuracy and performance of fine-tuned Large Language Models (LLMs) for real-world applications. This groundbreaking opportunity allows you to work with cutting-edge ML technologies, collaborating closely with partners to drive innovation and ensure smooth integration and deployment of ML solutions. Your expertise will be essential in automating ML workflows and optimizing performance, making a lasting impact in the field of AI.

Key Responsibilities

  • Develop, implement, and manage processes for optimizing LLMs using domain-specific data, improving the model's capability to complete structured tasks.
  • Develop and implement techniques to measure LLM performance, defining and monitoring metrics such as recall, F1, perplexity, BLEU, ROUGE, etc.
  • Utilize tools like ONNX and TensorRT for optimizing model inference on specialized hardware.
  • Collaborate with ISVs and IHVs to understand their performance requirements and ensure successful model integration.
  • Use C++ to improve ML model performance, specifically in performance-critical systems, and provide technical mentorship to junior engineers.

Qualifications

  • 8+ years of validated experience in system software or related field.
  • M.S. or higher degree in Computer Science/Data Science/Engineering or equivalent experience.
  • Deep understanding of transformer architectures and large language models like GPT, BERT, T5, or similar.
  • Validated hands-on experience with fine-tuning LLMs for specific tasks and improving model performance using libraries like PyTorch.
  • Strong ability to assess and optimize model performance using relevant metrics and evaluation techniques.
  • Proficiency in crafting and automating ML workflows using tools such as Kubeflow, MLflow, or Airflow.
  • Excellent problem-solving skills, especially in debugging and improving LLM accuracy for real-world applications.
  • Proficiency in Python and knowledge of C++ for optimizing performance and developing system-level integrations.
  • Strong interpersonal skills for effective collaboration with internal teams and external partners.

Preferred Qualifications

  • Experience with LLM-based function and tool calling systems.
  • Understanding of distributed training for LLM fine-tuning and cloud platforms like Nvidia's NVCF.
  • Familiarity with hardware acceleration for ML workloads, including GPU and specialized hardware optimizations.

Compensation

The base salary range is $180,000 - $339,250 USD annually. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.

Diversity and Inclusion

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We highly value diversity in our current and future employees and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Application Process

NVIDIA accepts applications on an ongoing basis. Join us in shaping the future of AI and computing.

Job ID: JR1990034

Benefits
Extracted with AI

  • Equity
  • Diverse work environment

Similar jobs

Last update: 23 minutes ago

Snowflake logo
Snowflake

Senior Software Engineer - LLM

Join Snowflake as a Senior Software Engineer to build scalable machine learning platforms with LLMs, leveraging Python and TensorFlow.

Snowflake logo
Snowflake

Senior Software Engineer - LLM

Join Snowflake as a Senior Software Engineer to build scalable machine learning platforms with LLMs, leveraging Python and TensorFlow.

Bonfy.AI logo
Bonfy.AI

Senior Software Engineer - LLM

Join Bonfy.AI as a Senior Software Engineer to develop and optimize scalable machine learning models using Python, TensorFlow, and cloud platforms.

Stripe logo
Stripe

ML Engineering Manager, LLM Foundation

Lead ML engineering team at Stripe, focusing on LLMs and AI/ML systems. Drive innovation and manage high-impact projects.

Poggio logo
Poggio

Senior AI Engineer

Join Poggio as a Senior AI Engineer to innovate AI systems for enterprise sales, focusing on AI capabilities and system performance.

Unisys logo
Unisys

LLM Engineer

Join Unisys as an LLM Engineer to revolutionize ITSM with large language models. Work remotely in Vilnius, Lithuania.

MoonPay logo
MoonPay

Machine Learning Engineer

Join MoonPay as a Machine Learning Engineer to build and maintain ML infrastructure, collaborating with data scientists and cross-functional teams.

Pipedrive logo
Pipedrive

Machine Learning Engineer

Join Pipedrive as a Machine Learning Engineer in Tallinn to deploy and optimize ML models, ensuring performance and compliance.

Pass App logo
Pass App

Machine Learning Engineer with Web3 and NLP Experience

Join Pass App as a Machine Learning Engineer to build AI solutions for web3, focusing on NLP and data pipelines.

Arena logo
Arena

Machine Learning Scientist

Join Arena as a Machine Learning Scientist to develop AI systems using PyTorch and TensorFlow, focusing on real-world problem-solving.

micro1 logo
micro1

LLM Engineer with Python and JavaScript

Join us as an LLM Engineer to design and develop scalable software solutions using Python, JavaScript, and AWS in a remote setting.

DwellFi  logo
DwellFi

AI Solutions Software Engineer

Join DwellFi as an AI Solutions Software Engineer to develop innovative AI solutions using LangChain or Llama. Remote position in Palo Alto, CA.

CentML logo
CentML

Senior Software Engineer - LLM Inference

Join CentML as a Senior Software Engineer focusing on LLM Inference, leveraging AI, ML, and GPU technologies.

Blueprint logo
Blueprint

AI Engineer - Machine Learning and Robotics

Join Blueprint as an AI Engineer in Machine Learning and Robotics, focusing on scalable AI model training systems. Hybrid role in Redmond, WA.

Poggio logo
Poggio

Senior AI Engineer

Join Poggio as a Senior AI Engineer to revolutionize sales with AI. Work remotely, leverage LLMs, and enhance AI systems.

Unicon, Inc. logo
Unicon, Inc.

Senior Software Developer - AI/LLM

Join Unicon as a Senior Software Developer specializing in AI/LLM, working on cutting-edge AI technologies in a hybrid role in Gilbert, AZ.

Shopify logo
Shopify

Machine Learning Platform Engineer

Join Shopify as a Machine Learning Platform Engineer to build cutting-edge AI infrastructure and tools. Work remotely in a dynamic environment.

EQT Ventures logo
EQT Ventures

Fullstack LLM Engineer

Join EQT Ventures as a Fullstack LLM Engineer to drive AI innovation in venture capital. Work with cutting-edge AI tools and data-driven insights.

Perplexity logo
Perplexity

AI Research Engineer - LLM Training

Join Perplexity as an AI Research Engineer to enhance LLMs using AI, ML, and NLP in San Francisco.

NVIDIA logo
NVIDIA

Senior Full-Stack Software Engineer

Join NVIDIA as a Senior Full-Stack Software Engineer, working on cutting-edge web applications and infrastructure.

micro1 logo
micro1

Senior LLM Engineer

Join our team as a Senior LLM Engineer, leveraging AWS, Python, and JavaScript to develop scalable AI solutions.

Multiverse Computing logo
Multiverse Computing

Senior Machine Learning Engineer

Join Multiverse Computing as a Senior Machine Learning Engineer to lead LLM projects using quantum AI technologies.

Abridge logo
Abridge

Senior Full Stack Engineer, LLM APIs

Join Abridge as a Senior Full Stack Engineer to build innovative ML-powered solutions in healthcare AI, focusing on LLM APIs and cloud services.

Snap Inc. logo
Snap Inc.

Machine Learning Engineer

Join Snap Inc. as a Machine Learning Engineer in New York, NY. Develop AI models, collaborate with teams, and drive innovation.