Senior Deep Learning Algorithm Engineer
Excelero Storage
NVIDIA’s GPU Workload Efficiency (GWE) team is looking for a skilled Senior Engineer to enhance performance in training and inference. We are developing methods to improve the efficiency of AI workloads on NVIDIA GPUs. This position entails collaborating on GPU architecture, deep learning frameworks, and large-scale applications to optimize performance. Come aboard and be a part of a team that spearheads the evolution in AI computing!
What you’ll be doing:
Evaluating, explaining, and improving deep learning workloads for both training and inference, contributing to advancements in throughput, latency, and efficiency across NVIDIA GPU platforms.
Collaborating across NVIDIA with researchers, engineers, and hardware specialists to recognize bottlenecks and achieve performance improvements.
Developing production-quality software across the deep learning platform stack, from frameworks to deployment.
Building automation and diagnostics that enable reproducible, scalable, and backend-agnostic performance improvements.
What we want to see:
5+ years of relevant experience in deep learning, high-performance computing, or related fields.
Master’s or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience).
Extensive background in improving deep learning workloads, showcasing a deep understanding of training and inference constraints.
Proven ability in GPU performance analysis and profiling, with hands-on experience applying advanced optimization techniques.
Solid knowledge of computer architecture and familiarity with the fundamentals of GPU development.
Strong programming skills in Python and C++.
Ways to stand out from the crowd:
Proven track record of analyzing, modeling, and tuning application performance with measurable impact.
Concrete experience in optimizing models in PyTorch for both training and inference tasks.
Developments in performance tooling, profiling infrastructure, or diagnostics that elevated training and inference efficiency.
Background in GPU programming (CUDA or OpenCL) is a strong plus, though not required.
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.