MaintainX Jobs

Site Reliability Engineer

MaintainX

Site Reliability Engineer

Reposted 13 Days Ago

In-Office

Toronto, ON, CAN

Mid level

In-Office

Toronto, ON, CAN

Mid level

The Site Reliability Engineer will enhance reliability, observability, and developer autonomy while collaborating with product and platform engineering teams and mentoring developers on reliability practices.

The summary above was generated by AI

MaintainX is the world's leading Asset and Work Intelligence platform for industrial and frontline environments. We are a modern, IoT-enabled, cloud-based tool for reliability, safety, and operations of physical equipment and facilities. MaintainX powers operational excellence for 12,000 businesses, including Duracell, Univar Solutions Inc., Titan America, McDonald's, Brenntag, Cintas, Xylem, and Shell.

We recently completed a $150 million Series D funding round, bringing our total funding to $254 million and valuing the company at $2.5 billion.

We’re looking for a Site Reliability Engineer to help advance MaintainX’s reliability, observability, and developer autonomy as we scale our platform.

In this role, you’ll partner closely with product and platform development teams to improve the stability, resilience, and operational readiness of our services. You’ll work alongside teams to design for reliability from the start, establish clear ownership and standards, and build shared tooling that enables teams to operate their services with confidence.

You’ll also contribute to company-wide initiatives that define how MaintainX approaches reliability software development, including observability standards, incident response practices, and service health metrics, helping the organization adopt proven industry practices at scale.

This role is well-suited for an developer who enjoys working across teams, influencing technical direction through strong development practices, and turning reliability principles into practical, scalable systems.

What You'll Do:

Assess service maturity and provide insights to development teams
Partner with development teams to implement observability best practices
Enable development teams to become autonomous with their service deployment, support, and infrastructure
Mentor developers on reliability practices, focusing on making them self-sufficient
Act as the bridge, ear and eyes of the Platform Division teams to drive tooling and practice adoption across development teams

About You:

Deep understanding of observability practices in a distributed system environment and how it influences system design and team behaviour
Practical experience with SRE concepts (SLOs, error budgets, incident management)
3–5+ years in software development, SRE, DevOps, or production development roles with experience operating production systems
Proficient in cloud-native platforms and infrastructure-as-code concepts and tools
Working knowledge of at least one programming language (TypeScript/Node.js is a plus)
Excellent communication and collaboration abilities across technical and non-technical teams
Ability to translate complex reliability concepts into actionable guidance
You enjoy enabling teams to succeed independently and measuring success by reduced dependency on you

What’s in it For You:

Competitive salary and meaningful equity opportunities.
Healthcare, dental, and vision coverage.
401(k) / RRSP enrollment program.
Take what you need PTO.
A Work Culture where:
- You’ll work alongside folks across the globe that reflect the MaintainX values, Smart Humble Optimist.
- We believe in meritocracy, where ideas and effort are publicly celebrated.

About Us:

Our mission is to deliver one platform for maintenance, repair & operations teams to keep the physical world running. We believe the greatest asset in any organization is the people. That’s why we built an intuitive, mobile-first solution to help boost productivity and collaboration across teams and locations.

MaintainX is committed to creating a diverse environment. All qualified applicants will receive consideration for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Similar Jobs

Enverus

Site Reliability Engineer

13 Days Ago

In-Office or Remote

Canada

Senior level

Big Data • Information Technology • Software • Analytics • Energy

Manage and scale Enverus' global AWS infrastructure, automate deployments and CI/CD, ensure high uptime, collaborate with developers to enable zero-downtime releases, participate in on-call rotations, and improve operational practices.

Top Skills: AWSAzureC#Ci/CdCloudFormationGoKubernetesLinuxPythonTerraformWindows

Serigor Inc

Site Reliability Engineer

Yesterday

In-Office

Toronto, ON, CAN

Mid level

Information Technology

Site Reliability Engineers apply software and systems engineering to improve reliability, performance, and operability. They deploy, configure, monitor, recover, and scale services, participate in on-call rotations, evaluate products before and after releases, and spend at least half their time engineering away problems while collaborating with teammates.

Top Skills: AnsibleAPIsAWSAzureC#ChefJavaJavaScriptLinuxPerlPHPPowershellPuppetPythonRubyShellWindows

Morningstar

Senior Site Reliability Engineer

7 Days Ago

Hybrid

Toronto, ON, CAN

Senior level

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services

Design and maintain CI/CD pipelines and AWS infrastructure using IaC; manage containerized deployments (Docker, ECS/EKS); provide on-call incident triage and post-incident reviews; lead reliability, disaster recovery, and security efforts; implement monitoring and automation (Splunk, CloudWatch, New Relic); write automation scripts in Python/Bash; document runbooks and collaborate with global teams.

Top Skills: Aws CloudwatchAws Ec2Aws EcsAws EksAws IamAws LambdaAws RdsAws Route 53Aws S3Aws SamAws VpcBashCdkClaude CodeCloudFormationDatadogDockerGithub ActionsGithub CopilotHarnessJenkinsLinux/UnixNew RelicPythonServerless FrameworkSplunkTerraform

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.