Sr. Site Reliability Engineer

Reposted 21 Days Ago

Be an Early Applicant

Toronto, ON

Mid level

Toronto, ON

Mid level

As a Sr. Site Reliability Engineer, you will develop software solutions for reliability, automate processes, manage capacity, and ensure system operability at scale.

The summary above was generated by AI

About Pinterest:

Millions of people across the world come to Pinterest to find new ideas every day. It’s where they get inspiration, dream about new possibilities and plan for what matters most. Our mission is to help those people find their inspiration and create a life they love. In your role, you’ll be challenged to take on work that upholds this mission and pushes Pinterest forward. You’ll grow as a person and leader in your field, all the while helping Pinners make their lives better in the positive corner of the internet.

Creating a life you love also means finding a career that celebrates the unique perspectives and experiences that you bring. As you read through the expectations of the position, consider how your skills and experiences may complement the responsibilities of the role. We encourage you to think through your relevant and transferable skills from prior experiences.

Our new progressive work model is called PinFlex, a term that’s uniquely Pinterest to describe our flexible approach to living and working. Visit our PinFlex landing page to learn more.

The Site Reliability Engineering organization at Pinterest is accountable for ensuring overall Pinterest availability as well as enhancing Engineering teams’ capability to design, build and operate robust systems at scale.

Pinterest’s applications and infrastructure that handle billions of monthly page views and petabytes of data as Pinterest continues to grow and scale. As a Pinterest SRE, you will design and build systems, platforms, tools, frameworks and methodologies to assure the reliability of our large-scale distributed systems.

What you’ll do:

Develop software solutions to enable reliability and operability of large scale distributed systems handling petabytes of data and serving
Build a deep understanding of how Pinterest’s systems behave, scale, interact and fail, and use that insight to identity risks and opportunities for remediation
Build tools and automation to eliminate toil and reduce operational overhead. Create frameworks, processes and best practices to be used across Pinterest Engineering
Build meaningful, insightful and actionable SLIs
Automate critical portions of Pinterest’s engineering processes, to minimize risk and maximize the speed of innovation
Manage capacity and performance to help scale our infrastructure both on public and private clouds around the world

What we’re looking for:

Strong knowledge of Linux/Unix/BSD internals and experience working with open source software (e.g. MySQL, Hadoop, Envoy, HAProxy, Nginx)
Bachelor’s or Master’s degree in a relevant field such as Computer Science, or equivalent experience
Experience with technologies such as ElasticSearch, ZooKeeper, HBase, Hadoop, Memcache and Kafka with a focus on reliability, automation, operability and performance
2+ years of experience with programming languages (Python, Java, Ruby, etc.)
Infrastructure as code a plus (e.g. Terraform, Puppet, Chef, Ansible, Salt, Fabric, Docker, etc)
Bonus points if experienced with deploying web apps to cloud infrastructure (AWS, etc.) and working with distributed, service-oriented architecture

Relocation Statement:

This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

In-Office Requirement Statement:

We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
This role will need to be in the office for in-person collaboration 1-2 times per half and therefore can be situated anywhere in Ontario.

#LI-HYBRID

#LI-CH1

Our Commitment to Inclusion:

Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, national origin, religion or religious creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected veteran, physical or mental disability, medical condition, genetic information or characteristics (or those of a family member) or any other consideration made unlawful by applicable federal, state or local laws. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you require a medical or religious accommodation during the job application process, please complete this form for support.

Top Skills

Ansible

AWS

Bsd

Chef

Docker

Elasticsearch

Envoy

Fabric

Hadoop

Haproxy

Hbase

Java

Kafka

Linux

MySQL

Nginx

Puppet

Python

Ruby

Salt

Terraform

Unix

Zookeeper

Similar Jobs

Braze

Senior Site Reliability Engineer II (Kafka)

3 Days Ago

Easy Apply

Remote

Hybrid

Ontario, ON, CAN

Easy Apply

Senior level

Marketing Tech • Mobile • Software

As a Senior Site Reliability Engineer, you will ensure the uptime of services, collaborate on architecture, manage incidents, and develop infrastructure automation to support high-scale operations.

Top Skills: AnsibleDockerKafkaKubernetesMongoDBRedisRuby On RailsTerraform

Broadridge

Senior Site Reliability Engineer (Hybrid)

7 Days Ago

Markham, ON, CAN

Senior level

Fintech • Financial Services

The Senior Site Reliability Engineer designs, implements, and supports technical infrastructure for applications, ensuring automation, security, and collaboration across teams.

Top Skills: AnsibleAWSAzureBladelogicChefJenkinsLinuxPerlPowershellPythonShell ScriptsTerraformWindows

Moneris

Senior Site Reliability Engineer

17 Days Ago

Senior level

Fintech • Payments • Financial Services

The Senior Site Reliability Engineer leads infrastructure design and management, supports applications, and implements scalable solutions while collaborating with technology teams.

Top Skills: AzureBashData Storage TechnologiesDatabasesDevops PracticesLinuxPowershellPythonShell ScriptingUnix

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

By clicking Apply you agree to share your profile information with the hiring company.