Find your dream job at Australia's leading startups and VCs

Our exceptional communities of founders and investors are constantly seeking passionate individuals like you to join their team. Find your fit in the postings below. Just browsing? Sign up to our newsletter here, and stay up to date on the latest jobs.
companies
Jobs

Deep Learning Performance Architect

Excelero Storage

Excelero Storage

IT
Shanghai, China
Posted on Jun 3, 2025

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning performance architect to join our AI performance modelling, analysis and optimization efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company.

What you'll be doing:

  • Analyze state-of-the-art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products.

  • Develop analytical models for the state-of-the-art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.

  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uniprocessor and multiprocessor configurations.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

What we need to see:

  • BS, MS or PhD in relevant discipline (CS, EE, Math, etc.) or equivalent experience.

  • 5+ years’ work experience.

  • Experience with popular AI models (e.g., LLM and AIGC models)

  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow/TensorRT)

  • Knowledge and experience on hardware architectures for deep learning applications

Ways to stand out from the crowd:

  • Background with CUDA and GPU computing systems

  • Experience on performance modelling or optimization of DL workloads