Crusoe Energy Systems Logo

Crusoe Energy Systems

Staff Infrastructure Engineer

Job Posted 19 Days Ago Posted 19 Days Ago
Remote
Hybrid
2 Locations
Senior level
Remote
Hybrid
2 Locations
Senior level
The Staff Infrastructure Engineer will manage cloud operations, develop automation tools, troubleshoot GPU hardware, and transition infrastructure to Kubernetes.
The summary above was generated by AI

Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated,  purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.

Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About the Role: 

We are seeking a Senior Software Infrastructure Engineer to play a critical role in managing Crusoe’s fleet operations, focusing on foundational tools for provisioning and reprovisioning servers with a strong emphasis on Infrastructure as code. The role includes building automation tools, troubleshooting hardware, and scaling operations to support high growth. The candidate will be integral in transitioning to Kubernetes and optimizing Crusoe's infrastructure.

This position offers the opportunity to work on cutting-edge technologies within a world-class team and contribute directly to the success of a rapidly growing company while making a significant impact on the global energy landscape.

Key Responsibilities:

  • Manage and maintain day-to-day operations of Crusoe’s cloud infrastructure.

  • Develop automation tools to streamline server provisioning and reduce SLA times.

  • Scale infrastructure to support mass deployments (80-100 servers simultaneously).

  • Troubleshoot hardware issues, especially with GPUs, and liaise with vendors.

  • Transition Crusoe’s environment to Kubernetes and containerized workflows.

You Will Thrive In This Role If You Have:

  • Solid hardware experience and GPU troubleshooting expertise.

  • Strong Linux background 

  • Knowledge of PXE booting and server provisioning (bare metal)

  • Experience with BMC/IPMI, BIOS, and enterprise-grade server management.

  • Kubernetes proficiency (admin or developer).

  • Familiarity with containerization technologies (Docker preferred).

  • Experience with version control systems ( Gitlab )

  • Problem solving skills - able to analyze complex technical issues and develop effective solutions

  • Strong communication and collaboration skills to work effectively with cross-functional teams

  • Values: Embody the Company values

  • Experience with MAAS (nice to have)

  • Proficiency in Python or Golang (preferred language) (nice to have)

  • Kubernetes administration and deployment experience (nice to have)

  • Experience with Ansible and Terraform (nice to have)

Benefits:

  • Hybrid work schedule

  • Industry competitive pay

  • Restricted Stock Units in a fast growing, well-funded technology company

  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

  • Employer contributions to HSA accounts 

  • Paid Parental Leave 

  • Paid life insurance, short-term and long-term disability 

  • Teladoc 

  • 401(k) with a 100% match up to 4% of salary

  • Generous paid time off and holiday schedule

  • Cell phone reimbursement

  • Tuition reimbursement

  • Subscription to the Calm app

  • MetLife Legal

  • Company paid commuter benefit; $100 per pay period

Compensation Range

Compensation will be paid in the range of $215,000 - $250,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.


#BI-Remote

Top Skills

Ansible
Bios
Bmc
Docker
Gitlab
Go
Ipmi
Kubernetes
Linux
Pxe
Python
Terraform

Similar Jobs at Crusoe Energy Systems

10 Days Ago
Remote
Hybrid
2 Locations
Senior level
Senior level
Cloud • Greentech • Other • Energy
Lead the design and implementation of core AI services, focusing on scalability and performance for a high-throughput AI inference platform.
Top Skills: Elastic ComputeGoGrpcKubernetesManaged DatabasesObject StoragePythonRest ApisVirtual Private Networks
15 Days Ago
Remote
Hybrid
2 Locations
Senior level
Senior level
Cloud • Greentech • Other • Energy
As a Site Reliability Engineer II on the Observability team, you'll manage and improve observability stacks, support engineering teams with monitoring, develop new tools, and analyze system performance for enhanced reliability.
Top Skills: AnsibleCircleCICloud FormationDockerGithub ActionsGitlab Ci/CdGoKubernetesPythonTerraform

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account