Graphcore Logo

Graphcore

Senior BMC Firmware Engineer

Posted 4 Hours Ago
Be an Early Applicant
Hybrid
Austin, TX
Senior level
Hybrid
Austin, TX
Senior level
Lead advanced troubleshooting for server blades and rack-scale infrastructure, support hardware bring-up, and conduct validation for AI compute platforms. Collaborate with engineering teams for system reliability.
The summary above was generated by AI
About us

Graphcore is one of the world’s leading innovators in Artificial Intelligence compute. It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.

As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.

Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.


Job Summary

We are seeking a Senior BMC Firmware Engineer to develop and support next-generation Baseboard Management Controller (BMC) firmware for Graphcore’s AI server platforms.
This role will focus on building secure, scalable, and cloud-ready platform management infrastructure across hyperscale AI systems. The successful candidate will contribute to OpenBMC firmware development from hardware bring-up through production deployment, enabling advanced telemetry, firmware lifecycle management, and platform orchestration capabilities across large-scale AI infrastructure.
The ideal candidate combines strong expertise in OpenBMC, Linux-based firmware development, server platform architecture, and low-level hardware integration with excellent collaboration and problem-solving skills.


The Team

The Systems Firmware and Platform Engineering teams are responsible for delivering the foundational firmware infrastructure powering Graphcore’s AI compute systems.
The team works closely with silicon engineering, server platform architects, hardware engineering, ODM/JDM partners, and data center operations teams to design, develop, validate, and deploy scalable platform management solutions for next-generation AI infrastructure.
This collaborative environment enables rapid innovation across server management, telemetry, firmware security, and fleet-scale deployment technologies while supporting the operational reliability of Graphcore’s AI platforms from early bring-up through production deployment.

Responsibilities and Duties
  • Develop and maintain OpenBMC firmware for cloud-scale ready server platforms, from hardware bring-up to production deployment. Strong hands-on experience with the OpenBMC stack is required.
  • Enable hardware integration including power, sensors, FRU inventory, and PCIe/NVLink device management.
  • Build and maintain manageability interfaces using D-Bus, Redfish, PLDM, and MCTP.
  • Support firmware security features including Root of Trust, secure/measured boot, and SPDM attestation.
  • Contribute to multi-component firmware update frameworks with A/B redundancy and rollback protection across BMC, BIOS, GPUs, and NICs.
  • Develop telemetry and observability capabilities for platform monitoring and diagnostics.
  • Collaborate with silicon vendors and ODM partners throughout platform development and manufacturing ramp.
  • Contribute to OpenBMC upstream and industry standards initiatives including DMTF, Redfish, and SPDM.
  • Support CI/CD infrastructure, Yocto builds, automated testing, and firmware validation workflows.
  • Perform system-level debug across hardware and firmware in lab and production environments.
  • Contribute to and support lab development, maintenance, hardware handling, and logistics activities.
Candidate ProfileEssential
  • Bachelor’s or Master’s degree in Electrical Engineering, Computer Engineering, Computer Science, or equivalent experience.
  • 6+ years of BMC firmware development for server or data center platforms.
  • Deep expertise in OpenBMC, including D-Bus services, Yocto/BitBake, and platform integration.
  • Strong programming skills in C++, C, Linux kernel/user-space, and scripting languages including Python and Bash.
  • Experience with Redfish, PLDM, SPDM, MCTP, and IPMI.
  • Solid understanding of server hardware architecture including power systems, thermals, sensors, and FRUs.
  • Experience with system interfaces and buses including I²C, I3C, SPI, PCIe, and related platform control interfaces.
  • Experience with BIOS–BMC interactions, telemetry, RAS, and firmware lifecycle management.
  • Experience building CI/CD pipelines for firmware development and validation.
  • Strong system debugging skills using JTAG, GDB, and logic analyzers.
  • Familiarity with firmware security concepts including Root of Trust, secure boot, and signed firmware updates.
  • Familiarity with hardware schematics and logical hardware debugging methodologies.
Desirable
  • Experience developing OpenBMC for ARM-based server platforms in hyperscale environments.
  • Hands-on experience with the OpenBMC codebase, including platform bring-up and upstream contributions.
  • Experience working with ODM/JDM partners across the platform lifecycle.
  • Experience delivering firmware for large-scale cloud deployments and fleet update infrastructure.

We welcome people of different backgrounds and experiences and are committed to building an inclusive work environment that makes Graphcore a great home for everyone. We are an equal opportunity employer and want to build a work environment where everyone is happy, productive and respectful so they can do their best work. If you have a disability or additional need that requires accommodation, just let us know.

Similar Jobs at Graphcore

2 Hours Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Semiconductor
Lead the development of OpenBMC firmware for hyperscale platforms, ensuring collaboration with partners and integration into CI/CD pipelines. Debug and design interfaces for platform management while aligning with hardware teams.
Top Skills: BashC/C++Github ActionsGitlab CiIpmiJenkinsLinuxOpenbmcPmciPythonRedfishSnmpSshVncYocto
Senior level
Artificial Intelligence • Semiconductor
The role involves leading the bring-up and characterisation of silicon devices, executing tests, and liaising with engineering teams to ensure correct operation under all conditions.
Top Skills: DdrEthernetJmpPciePython
Mid level
Artificial Intelligence • Semiconductor
The engineer will create and execute plans for silicon device characterisation, liaise with engineering teams, and automate testing processes while producing detailed reports.
Top Skills: DdrEthernetJmpPciePython

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account