Staff Engineer - AI Compute Blade and Rack Validation

DevOps / Infra US - Austin Today
Apply for this role
Listed via Greenhouse · Redirects to Graphcore's careers page

Job Description

About us

Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.

It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.

As part of the SoftBank Group, Graphcore is a member of a best-in-class family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.

Graphcore’s teams are drawn from a diverse group of backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.

Job Summary

We are seeking a senior validation lead engineer to lead at-scale rack validation efforts for next-generation AI hyperscale systems. This role focuses on post-silicon system validation across the full lifecycle, ensuring functional, electrical, and thermal performance meets product objectives. You will own end-to-end blade and rack validation including planning, development, execution, and debug while collaborating across firmware, systems, and hardware teams.

The Team

The Rack Validation team is responsible for ensuring system readiness and quality at scale. The team works cross-functionally with firmware, silicon, and system engineering teams to validate complex AI compute platforms.

Responsibilities and Duties

  • Lead post-silicon validation of AI compute blades and racks including test planning, development, and automation.
  • Drive provisioning and integration of system components (SoC FW, BMC, RMC, OS) for rack-level readiness.
  • Own execution against program achievements and report validation progress and risks.
  • Triage test failures, collect debug data, and collaborate on root cause analysis.
  • Track validation coverage and continuously improve test processes and infrastructure.
  • Collaborate with ODM/JDM partners on validation and quality.
  • Mentor engineers and drive engineering excellence.

Candidate Profile

Essential:

  • Bachelor’s or Master’s degree or equivalent experience in Computer Engineering, Electrical Engineering, Computer Science, or related field.
  • Proven track record in system, rack, or embedded validation with leadership experience.
  • Strong experience in large-scale hardware validation environments.
  • Expertise in CPU/GPU, memory, IO, and firmware validation.
  • Experience with Linux/server OS and automation using Python/Bash.
  • Knowledge of IPMI, Redfish, PLDM.
  • Experience with CI/CD pipelines and hardware interfaces.

Desirable:

  • Experience in hyperscale environments.
  • Familiarity with OpenBMC and processes for verifying firmware functionality.
  • Knowledge of firmware security and HIL testing.
  • Experience with test management tools.

About Graphcore

Graphcore is actively hiring on The Code Deck.

All Graphcore jobs →
Career Toolkit

Ready to apply?

Check your CV against this job, generate a cover letter, and prep for the interview — all in one place.

Open Career Toolkit →
For Employers

Want this spot?

Pin your listing to the top of every search with a gold Featured badge. From £49.

Feature a listing →

Paste your CV

We'll save it so you can tailor it to any job with one click.