Graphcore Logo

Graphcore

Senior System Validation Engineer

Reposted 9 Hours Ago
Be an Early Applicant
Hybrid
Austin, TX
Senior level
Hybrid
Austin, TX
Senior level
The Senior System Validation Engineer will design and implement validation tests for Arm-based SoCs, automate tests using Python, and analyze results across processing units and memory systems.
The summary above was generated by AI
About us

Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.

It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.

As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies.

Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives.

Job Summary

We are seeking a Senior System Validation Engineer to design and implement validation tests for Arm-based data center SoCs using a combination of open-source benchmarking tools and custom micro-benchmarks. Define test cases, tune workload parameters, and use Python to automate and analyze validation across processing units, system memory, communication links, and PCI Express components.  

The Team

The Systems Validation team validates Graphcore’s AI compute platforms across hardware, firmware, and software domains.

Responsibilities and Duties
  • Translate hardware features into structured, parameterized test cases, including:
    • Objectives, workload models, parameters, metrics, and pass/fail criteria. 
  • Use and tune open-source tools such as:
    • Stress-ng, fio, iperf, sysbench, Imbench
  • Define workload parameters:
    • Concurrency (threads/cores), memory footprint, access patterns
    • Queue depth, block size, CPU/NUMA affinity. 
  • Design and execute validation scenarios:
    • Functional, performance, stress, and corner-case testing.
  • Validate across domains:
    • CPU scaling and cache behavior
    • Memory (DDR/HBM) bandwidth, latency, and NUMA effects
    • Interconnect contention under multi-core load
    • PCIe/I-O throughput, latency, and multi-device scenarios
  • Develop custom micro-benchmarks (C/C++) when existing tools are insufficient
  • Use Python to:
    • Automate test execution and orchestration
    • Implement parameter sweeps and test matrices 
    • Parse, analyze, and visualize results. 
  • Debug system behavior and performance issues in collaboration with hardware and firmware teams
Candidate Profile

Essential:

  • Strong background in Linux systems and low-level performance analysis
  • Experience in system validation, benchmarking, or performance testing
  • Solid understanding of one or more domains:
    • CPU architecture, memory systems, or PCIe
  • Experience using benchmarking tools (stress-ng, fio, iperf, etc.)
  • Ability to define test cases, parameters, and workload models
  • Proficiency in Python (essential) for automation, orchestration, and data analysis
  • Programming experience in C/C++ for micro-benchmark development
Desirable:
  • Experience with Arm-based platforms
  • Strong programming skills in C/C++ and Python for system-level development and automation
  • Familiarity with performance tools (perf, numactl, taskset)
  • Experience with large-scale or high core-count systems

USA Benefits

In addition to a competitive salary, Graphcore offers flexible working and a comprehensive benefits package designed to support your health, wellbeing and financial future. Our benefits include medical, dental and vision coverage, Flexible Spending Accounts (FSAs), Health Savings Accounts (HSAs), disability and life insurance, a 401(k) retirement plan, commuter benefits, wellness services and an Employee Assistance Programme (EAP). We welcome people of different backgrounds and experiences; we're committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.

Similar Jobs at Graphcore

An Hour Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Semiconductor
The candidate will lead troubleshooting and engineering support for AI hardware platforms, collaborating with teams on validation and performance.
Top Skills: Ai Compute PlatformsBashHpc SystemsPythonServer Hardware Architectures
5 Hours Ago
Hybrid
Mid level
Mid level
Artificial Intelligence • Semiconductor
The Staff Hardware Engineer leads troubleshooting and validation of AI compute platforms, collaborating with various teams to ensure hardware reliability and performance.
Top Skills: BashPython
7 Hours Ago
Hybrid
Mid level
Mid level
Artificial Intelligence • Semiconductor
Lead cross-functional programs for AI networking infrastructure, ensuring alignment between network architecture, hardware development, and operational readiness. Manage supplier relationships and oversee the technical roadmap for scalable AI infrastructure delivery.
Top Skills: AIEthernetHigh-Speed Optical InterconnectNetworkingOptical ConnectivityPcieRdmaRocev2Silicon PhotonicsUcie

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account