Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/4KA4ddZinkyKYQX8A
Back to the job results

HPC Infra Engineer

7 days ago 2026/10/28
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

NVIDIA is looking for an exceptional engineer to grow and thrive alongside our CAD/EDA/HPC team. You will build and scale the compute infrastructure that powers NVIDIA's next-generation silicon — owning job scheduler environments, cloud


compute integration, CAD toolchains, and automation frameworks that keep our design teams moving at full speed toward tapeout.


What you'll be doing:


  • Be part of the CAD/EDA/HPC team building and scaling the compute infrastructure that powers NVIDIA's next-generation silicon design.


  • Own job scheduler environments, CAD toolchains, automation frameworks, and operational workflows that keep design teams moving efficiently toward tapeout.


  • Integrate and operate hybrid cloud environments across AWS, Azure, GCP, or OCI to elastically extend on-premises CAD capacity.


  • Troubleshoot CAD/EDA software and infrastructure performance issues, benchmark workloads, and improve tool and compute efficiency.


  • Build automation in Python, Perl, Bash, or Tcl for job scheduling, monitoring, capacity reporting, and recurring operational workflows.


  • Operate large-scale Linux compute farms using LSF and/or Slurm while partnering with design teams on throughput, utilization, and tapeout capacity planning.


What we need to see:


  • B.E./B.Tech or M.Tech/M.S. in Computer Science, Electronics Engineering, or a related field, or equivalent experience.


  • 3+ years of hands-on experience in HPC system administration, Cloud infrastructure, Systems engineering or SRE roles supporting engineering infrastructure.


  • Strong Linux/Unix administration skills, large-scale compute farm experience with LSF and/or Slurm, and proficiency in at least one scripting language; Python is preferred.


  • Hands-on experience managing GPU nodes/compute farms


  • Hands-on knowledge of cloud platforms such as GCP, OCI, AWS, or Azure, including compute, storage, networking, and cost fundamentals.


  • Preferred exposure to Linux performance engineering, Docker/Kubernetes, infrastructure as code such as Ansible or Terraform, distributed file systems, and observability stacks.


  • Experience supporting the environments where CAD/EDA flows such as synthesis, place and route, simulation, DRC/LVS, or equivalent implementation and verification flows is preferred.


This job post has been translated by AI and may contain minor differences or errors.
You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.