Find your dream job at Australia's leading startups and VCs

Our exceptional communities of founders and investors are constantly seeking passionate individuals like you to join their team. Find your fit in the postings below. Just browsing? Sign up to our newsletter here, and stay up to date on the latest jobs.
companies
Jobs

Senior Deep Learning Algorithm Engineer

Excelero Storage

Excelero Storage

United States · Santa Clara, CA, USA · Remote
USD 184k-356,500 / year + Equity
Posted on Jan 23, 2026

NVIDIA’s GPU Workload Efficiency (GWE) team is looking for a skilled Senior Engineer to enhance performance in training and inference. We are developing methods to improve the efficiency of AI workloads on NVIDIA GPUs. This position entails collaborating on GPU architecture, deep learning frameworks, and large-scale applications to optimize performance. Come aboard and be a part of a team that spearheads the evolution in AI computing!

What you’ll be doing:

  • Evaluating, explaining, and improving deep learning workloads for both training and inference, contributing to advancements in throughput, latency, and efficiency across NVIDIA GPU platforms.

  • Collaborating across NVIDIA with researchers, engineers, and hardware specialists to recognize bottlenecks and achieve performance improvements.

  • Developing production-quality software across the deep learning platform stack, from frameworks to deployment.

  • Building automation and diagnostics that enable reproducible, scalable, and backend-agnostic performance improvements.

What we want to see:

  • 5+ years of relevant experience in deep learning, high-performance computing, or related fields.

  • Master’s or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience).

  • Extensive background in improving deep learning workloads, showcasing a deep understanding of training and inference constraints.

  • Proven ability in GPU performance analysis and profiling, with hands-on experience applying advanced optimization techniques.

  • Solid knowledge of computer architecture and familiarity with the fundamentals of GPU development.

  • Strong programming skills in Python and C++.

Ways to stand out from the crowd:

  • Proven track record of analyzing, modeling, and tuning application performance with measurable impact.

  • Concrete experience in optimizing models in PyTorch for both training and inference tasks.

  • Developments in performance tooling, profiling infrastructure, or diagnostics that elevated training and inference efficiency.

  • Background in GPU programming (CUDA or OpenCL) is a strong plus, though not required.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until January 26, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.