Job Overview
As a Machine Learning Engineer (LLM Fine-tuning & Performance Specialist), you will play a crucial role in enhancing the accuracy and performance of fine-tuned Large Language Models (LLMs) for real-world applications. This groundbreaking opportunity allows you to work with cutting-edge ML technologies, collaborating closely with partners to drive innovation and ensure smooth integration and deployment of ML solutions. Your expertise will be essential in automating ML workflows and optimizing performance, making a lasting impact in the field of AI.
Key Responsibilities
- Develop, implement, and manage processes for optimizing LLMs using domain-specific data, improving the model's capability to complete structured tasks.
- Develop and implement techniques to measure LLM performance, defining and monitoring metrics such as recall, F1, perplexity, BLEU, ROUGE, etc.
- Utilize tools like ONNX and TensorRT for optimizing model inference on specialized hardware.
- Collaborate with ISVs and IHVs to understand their performance requirements and ensure successful model integration.
- Use C++ to improve ML model performance, specifically in performance-critical systems, and provide technical mentorship to junior engineers.
Qualifications
- 8+ years of validated experience in system software or related field.
- M.S. or higher degree in Computer Science/Data Science/Engineering or equivalent experience.
- Deep understanding of transformer architectures and large language models like GPT, BERT, T5, or similar.
- Validated hands-on experience with fine-tuning LLMs for specific tasks and improving model performance using libraries like PyTorch.
- Strong ability to assess and optimize model performance using relevant metrics and evaluation techniques.
- Proficiency in crafting and automating ML workflows using tools such as Kubeflow, MLflow, or Airflow.
- Excellent problem-solving skills, especially in debugging and improving LLM accuracy for real-world applications.
- Proficiency in Python and knowledge of C++ for optimizing performance and developing system-level integrations.
- Strong interpersonal skills for effective collaboration with internal teams and external partners.
Preferred Qualifications
- Experience with LLM-based function and tool calling systems.
- Understanding of distributed training for LLM fine-tuning and cloud platforms like Nvidia's NVCF.
- Familiarity with hardware acceleration for ML workloads, including GPU and specialized hardware optimizations.
Compensation
The base salary range is $180,000 - $339,250 USD annually. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.
Diversity and Inclusion
NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We highly value diversity in our current and future employees and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
Application Process
NVIDIA accepts applications on an ongoing basis. Join us in shaping the future of AI and computing.
Job ID: JR1990034
Benefits Extracted with AI
- Equity
- Diverse work environment
Similar jobs
Last update: 23 minutes ago
Senior Deep Learning Performance Software Engineer
Senior role optimizing deep learning performance at NVIDIA, involving Python, HPC, and AI technologies.
Senior Deep Learning Engineer
Join NVIDIA as a Senior Deep Learning Engineer to optimize AI performance using PyTorch, TensorFlow, and more in Berlin.
Senior Artificial Intelligence Algorithms Engineer
Senior AI Algorithms Engineer role focusing on AI/DL, data analytics, and machine learning in Santa Clara, CA.
Senior Software Engineer - LLM Inference
Join CentML as a Senior Software Engineer focusing on LLM Inference, leveraging AI, ML, and GPU technologies.
Senior Machine Learning Performance Engineer
Join Wayve as a Senior Machine Learning Performance Engineer to optimize large-scale training jobs and improve GPU efficiency.
Trustworthy AI Software Engineer
Join NVIDIA as a Trustworthy AI Software Engineer in Santa Clara, CA. Develop cutting-edge AI tools and models in a multidisciplinary team.
Senior Full Stack Engineer, Deep Learning Algorithms
Join NVIDIA as a Senior Full Stack Engineer to build software for AI, focusing on deep learning algorithms and high-performance computing.
Senior Software Engineer - LLM
Join Snowflake as a Senior Software Engineer to build scalable machine learning platforms with LLMs, leveraging Python and TensorFlow.
Senior Engineering Manager, Robotics and ML Applications
Senior Engineering Manager for Robotics & ML at NVIDIA, leading innovative projects in AI and robotics applications.
Artificial Intelligence and Deep Learning Intern
Join NVIDIA's 2025 AI and Deep Learning Internship to work on cutting-edge projects in AI, robotics, and more.
Senior Research Scientist, Multimodal Foundation Models and Robotics
Senior Research Scientist role at NVIDIA, focusing on AI and robotics, based in Santa Clara, CA.
Machine Learning Engineer
Join Refuel as a Machine Learning Engineer to develop core ML algorithms, improve datasets, and collaborate on product scalability.
Senior Software Engineer - LLM
Join Snowflake as a Senior Software Engineer to build scalable machine learning platforms with LLMs, leveraging Python and TensorFlow.
Lead Research Scientist - Large Language Models (LLMs)
Lead Research Scientist role focusing on AI and Machine Learning, driving innovation in smart app development with LLMs.
Machine Learning Engineer, Cloud AI
Join Qualcomm as a Machine Learning Engineer to develop AI solutions for mobile, edge, auto, and IoT products.
Software Engineer, Machine Learning Infrastructure
Join Tesla as a Software Engineer in ML Infrastructure to optimize and scale neural network training with Python, C++, and PyTorch.
Founding AI Engineer
Join LlamaIndex as a Founding AI Engineer to shape the future of LLM applications with cutting-edge AI projects.
Deep Learning Computer Architecture Intern
Join NVIDIA as a Deep Learning Computer Architecture Intern. Work on cutting-edge AI projects with a leading company in accelerated computing.
Software Machine Learning (ML) Architect
Join AMD as a Software ML Architect to design and implement AI solutions for next-gen GPU products.
Machine Learning Compiler Engineer
Join Qualcomm as a Machine Learning Compiler Engineer to optimize ML compilers for cutting-edge accelerators.
Founding Applied AI Engineer
Join LlamaIndex as a Founding Applied AI Engineer to build and deploy LLM applications. Competitive salary and equity offered.
Senior Machine Learning Researcher
Join Lambda as a Senior Machine Learning Researcher to develop AI models and optimize ML workloads. Work in San Jose with flexible benefits.
Senior Applied Scientist - Large Language Models
Join Amazon as a Senior Applied Scientist to develop cutting-edge AI agents using Large Language Models in Sunnyvale, CA.
Senior Machine Learning Engineer
Join Truva as a Senior Machine Learning Engineer to innovate in generative AI and machine learning with LLMs and NLP.