Senior Research Engineer - Enterprise Products
21 Days Old
Senior Research Engineer - Enterprise Products Join to apply for the Senior Research Engineer - Enterprise Products role at NVIDIA
Senior Research Engineer - Enterprise Products 2 weeks ago Be among the first 25 applicants
Join to apply for the Senior Research Engineer - Enterprise Products role at NVIDIA
We are now looking for a Senior Research Engineer passionate about Generative AI inference. Are you excited to change the way people infuse AI into products and services? NVIDIA is at the forefront of generative AI models, from language to images. NVIDIA provides building blocks to democratize AI and make generative AI easy to develop, integrate, and deploy. Our team is dedicated to developing optimized inferencing technologies to support our growing generative AI needs. We contribute to all steps of the machine learning lifecycle: from conceptualization, to applied research, engineering for optimized inference, and deployment. Collaborate with research teams, engineers, and open-source community. Implement optimized LLM algorithms.
What You Will Be Doing
Developing new models and algorithms focused on Large Language Models, Natural Language Processing, and Deep Learning.
Design and implement multi-node serving architectures disaggregated serving and distributed LLM inference
Optimize multi-LoRA (and other PEFT technique) inference serving systems
Apply sophisticated quantization techniques (FP4/INT4, FP8) to reduce model footprint while preserving quality
Implement speculative decoding (draft target, eagle, medusa etc) and other latency optimization strategies
Demonstrating good engineering practices and mentoring other team members to do the same.
Collaborating with engineering teams across all of NVIDIA to ensure our software integrates seamlessly up and down the NVIDIA accelerated serving stack.
What We Need To See
Understanding of modern techniques in Machine Learning, Deep Neural Networks, Natural Language Processing, or Speech Recognition.
8+ years of industry experience in Deep Learning frameworks (PyTorch or TensorFlow).
Passion for software engineering, especially with excellent C++ and Python development skills, with meaningful contributions to major open-source projects.
Strong communication and interpersonal skills, along with the ability to work in a dynamic and distributed team. A history of mentoring junior engineers and interns is a huge plus.
Bachelor's degree or equivalent experience.
A desire to constantly grow and learn new things.
Strong computer science fundamentals - algorithms and data structures, computational complexity, parallel and distributed computing, system software.
Ways To Stand Out From a Crowd
Experience architecting or developing large-scale distributed systems for deep learning.
Knowledge of CPU and/or GPU architecture.
GPU programming (CUDA).
The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
JR1998706
Seniority level Seniority level Mid-Senior level
Employment type Employment type Full-time
Job function Job function Engineering and Information Technology
Industries Computer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing
Referrals increase your chances of interviewing at NVIDIA by 2x
Get notified about new Senior Research Engineer jobs in Washington, United States .
Sr Machine Learning Engineer, Applied Research Science Seattle, WA $142,000.00-$211,000.00 1 month ago
Senior Deep Learning Compiler Engineer - XLA Washington, United States $148,000.00-$287,500.00 3 weeks ago
Senior Deep Learning Compiler Engineer - CUDA Washington, United States $148,000.00-$287,500.00 2 weeks ago
Seattle, WA $170,000.00-$230,000.00 5 months ago
Washington, United States $148,000.00-$287,500.00 21 hours ago
Senior Software Engineer I - Front End (Remote Eligible) Bellevue, WA $140,000.00-$200,000.00 1 week ago
Senior Software Engineer-Distributed Inference Washington, United States $184,000.00-$356,500.00 6 days ago
Senior Software Engineer, FusionFeed API Redmond, WA $119,800.00-$258,000.00 1 day ago
Senior Software Engineer II (Remote Eligible) Bellevue, WA $200,000.00-$252,500.00 5 days ago
Senior Software Engineer, Data Infrastructure Senior Software Development Engineer, Big Data Washington, United States $158,200.00-$252,800.00 1 day ago
Washington, United States $148,000.00-$287,500.00 2 weeks ago
Bellevue, WA $130,000.00-$180,000.00 6 days ago
Seattle, WA $114,200.00-$142,700.00 4 days ago
Senior Software Engineer I, Front End - Chart View (Remote Eligible) Bellevue, WA $140,000.00-$185,000.00 1 week ago
Seattle, WA $190,000.00-$220,000.00 1 month ago
Seattle, WA $150,000.00-$180,000.00 1 week ago
Redmond, WA $119,800.00-$258,000.00 1 day ago
Greater Seattle Area $197,000.00-$237,000.00 1 week ago
Washington, United States $148,000.00-$287,500.00 3 weeks ago
Seattle, WA $140,000.00-$200,000.00 21 hours ago
Seattle, WA $79,800.00-$178,100.00 2 days ago
Senior Software Engineer: Infrastructure Greater Seattle Area $175,000.00-$200,000.00 1 day ago
Sr React Native Software Engineer - FinTech Startup - Remote Seattle, WA $140,000.00-$160,000.00 2 days ago
Seattle, WA $140,000.00-$180,000.00 3 weeks ago
Washington, United States $160,000.00-$175,000.00 1 day ago
Senior Software Engineer, Backend - Fintech Seattle, WA $90,000.00-$185,000.00 4 days ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
- Location:
- Washington, DC, United States
- Job Type:
- FullTime
- Category:
- Engineering