Prodigy Education Logo

Prodigy Education

Senior Site Reliability Engineer

Posted 7 Hours Ago
Be an Early Applicant
In-Office
Toronto, ON, CAN
Senior level
In-Office
Toronto, ON, CAN
Senior level
As a Senior Site Reliability Engineer, you'll manage cloud platforms, write Infrastructure as Code, optimize systems, and improve developer experiences through automation and tooling.
The summary above was generated by AI

Prodigy Education is a global leader in game-based learning. Our mission is to help every student in the world love learning, motivating more than 20 million students a year to practice standards-aligned math and English. More than 800,000 teachers use Prodigy as a free instructional tool. Fun, motivating, and research-based, Prodigy is the EdTech platform students actually ask to use. Visit www.prodigygame.com to learn more.

Want to learn more about what Prodigy and our People have been up to go click on Prodigy News!

Vacancy Status

This job posting is for an existing vacancy.

Overview

As a Senior Site Reliability Engineer at Prodigy, you will join a high-leverage Infrastructure team that owns our cloud platform end-to-end. You aren't just "using" tools, you are building the foundation that allows millions of students to access adaptive learning. You will manage Kubernetes clusters, GitOps pipelines, and Terraform-defined AWS environments while creating the internal tooling that empowers our entire engineering organization.

🚀 Your Impact

  • System Ownership: Own and modernize significant systems across EKS, ArgoCD, and AWS to ensure the platform scales with our growing student base.

  • Infrastructure as Code: Write and maintain high-quality Terraform and Helm code that serves as the standard for other product teams.

  • Developer Empowerment: Build and maintain Go/Python-based CLIs and automation that simplify the developer experience for every engineer at Prodigy.

  • Operational Excellence: Participate in on-call rotations and lead incident responses, turning production "fires" into permanent architectural improvements.

  • Observability & Tuning: Optimize Datadog instrumentation and profile Node.js workloads to find and fix performance bottlenecks before they impact users.

🌟 About You

  • Experience: 5+ years in SRE, Platform, or Infrastructure roles running production systems at scale.

  • Kubernetes Expert: Deep understanding of K8s internals, debugging complex failures, and managing manifests via Helm or Kustomize.

  • Cloud & IaC: Advanced proficiency in AWS (IAM, Networking, EKS) and writing reusable, modular Terraform.

  • Coder, not just a Scripter: Ability to write clean, maintainable code in Go or Python to build production-grade internal tooling.

  • Communicator: High bar for written clarity, essential for postmortems and documentation in our remote-friendly environment.

💡 Bonus Points

  • Experience with GitOps workflows using ArgoCD.

  • Hands-on experience profiling or optimizing Node.js/TypeScript services.

  • Knowledge of Service Mesh architectures or Kubernetes Gateway API.

  • Background in EdTech or high-concurrency consumer platforms.

🏆 Working on the Team

  • Small but Mighty: Join a senior team where your individual contributions have outsized impact on the entire company.

  • Modern Stack: Work with cutting-edge tools like ArgoCD, Kubernetes Gateway API, and Drata for compliance automation.

  • Culture of Learning: We prioritize "correct over quick," focusing on postmortems that lead to real improvements rather than finger-pointing.

Working at Prodigy

Be part of a mission-driven organization dedicated to helping every student in the world love learning! At Prodigy Education, your work positively impacts the lives of millions of students and teachers worldwide.

We understand that a thriving team is at the core of our success. So, on top of an inspirational mission and rewarding work, our Total Rewards Program reflects our commitment to your financial, physical, and mental well-being

🥇The world’s biggest math competition is back, bringing more champions, more classroom excitement, more prizes and even more fun for students! Check out round two of Prodigy's State Challenge! The Prodigy State Challenge Returns!

Come as you are. We believe the power of our collective potential will transform education. We are building towards a diverse, inclusive, and equitable workplace to empower and create access and opportunity for all. We welcome applications from people from all underrepresented groups, including (but not limited to) people of any gender, age, or religion, members of the LGBTQIA2+ community, BIPOC and other underrepresented races and nationalities, people with disabilities, veterans, and anyone who may contribute to the further diversification of Prodigy Education. If you feel like you don’t have all the qualifications for this position and are willing to use your initiative to learn the rest, we’d still love for you to apply!

We are an equal opportunity employer and are committed to providing employment accommodation in accordance with the Ontario Human Rights Code and the Accessibility for Ontarians with Disabilities Act, 2005 (AODA). Prodigy Education will provide accommodations to job applicants with disabilities throughout the recruitment process. If you require accommodation, please notify us at [email protected], and we will work with you to meet your needs.

AI Disclosure

Prodigy leverages AI-assisted technology to enhance efficiency in areas such as resume screening. However, all hiring decisions are made by our recruitment team in collaboration with hiring managers.

Prodigy may use a notetaker during interviews to transcribe conversations, generate summaries. This technology helps our interviewers focus on the conversation.

 
 

Prodigy Education Toronto, Ontario, CAN Office

144 Front St W, Suite 400, Toronto, Ontario, Canada

Similar Jobs

16 Days Ago
Remote or Hybrid
Canada
Senior level
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
As a Senior Site Reliability Engineer, you will ensure software reliability and scalability, manage IAC, CI/CD, monitor systems, and mentor junior engineers while collaborating across teams.
Top Skills: AnsibleArgocdBashDatadogGithub ActionsGitlabGoHashicorp ConsulHelmKubernetesPackerPostgresPowershellPythonSQL ServerTerraformTypescript
12 Days Ago
In-Office or Remote
CA
Senior level
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Senior Site Reliability Engineer will enhance reliability of Block's platform, improve incident response using AI tools, and coordinate incident management. Responsibilities include building reliable systems, standardizing tools, and leading high-severity incidents during on-call rotations.
Top Skills: Amazon Web ServicesDatadogDynamoDBGrpcHTTPIstioJavaJSONKotlinKubernetesLaunchdarklyMySQLProtocol BuffersTerraformVitess
8 Days Ago
In-Office or Remote
Canada
Senior level
Senior level
Artificial Intelligence • Computer Vision • Machine Learning
As a Senior Site Reliability Engineer, you will own AWS infrastructure, ensure reliability, and enhance deployment processes, primarily working with Kubernetes and Terraform.
Top Skills: ArgocdAWSBashGithub ActionsGrafanaHelmKubernetesKustomizePrometheusPythonTerraform

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account