Autodesk Logo

Autodesk

Principal Site Reliability Developer

Posted 4 Days Ago
Be an Early Applicant
Toronto, ON
Senior level
Toronto, ON
Senior level
The Principal Site Reliability Engineer will manage Autodesk's cloud infrastructure, focusing on reliability, availability, and performance. Responsibilities include leading the design, implementation of architecture, and automation of infrastructure deployments using modern DevOps tools, while ensuring compliance and security. The role also involves operational support, monitoring, and participation in disaster recovery strategies.
The summary above was generated by AI

Job Requisition ID #

25WD85835

Job Title: Principal Site Reliability Developer

Location: Toronto, Canada. (Hybrid)

Position Overview

We are seeking a highly motivated and experienced Principal Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring

the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the

platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners.

Responsibilities

  • Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture.
  • Independently manage requirement analysis, solution design, implementation, and release planning.
  • Ensure high adherence to trust and security compliance, guidelines and standards.
  • Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security.
  • Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices.
  • Implement and maintain configuration management and infrastructure as code (IaC) using Terraform.
  • Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and period maintenance activities.
  • Contribute to critical vulnerability (CVEs) remediation efforts.
  • Promote and document security and best practices across all pillars of DevOps/SRE throughout system design.
  • Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues.
  • Participate in on-call rotations, providing critical 24x7 support for production systems.
     

Minimum Qualifications

  • Bachelor’s degree or higher in Computer Science, Engineering, or a related field.
  • 8+ years of progressive experience in Site Reliability Engineering, DevOps, or a similar field.
  • Proficiency with managing AWS resources and understanding of networking and security protocols.
  • Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation.
  • Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory.
  • Experience with container-based technologies like Docker and AWS ECS.
  • Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch.
  • Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment.
  • Proficiency in programming languages such as UNIX, Python, Go, Bash, Groovy, and Node.js.
  • Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, GoLang, Node.js, Groovy, Python, Jenkins, GitHub, Jira, ServiceNow, and Splunk.

Preferred Qualifications

  • Knowledge in applying AI and ML solutions for engineering processes and/or DevOps automation.
  • Knowledge of standardized observability frameworks such as OpenTelemetry.
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer).
  • Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures.
  • Broad knowledge with data streaming pipelines like Kinesis, Firehose, and Kafka.
  • Knowledge on core Java and SpringBoot concepts in JVM optimization.
  • Knowledge on build tools, e.g. Gradle.
  • Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment.
  • Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership.

Learn More

About Autodesk
Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – our Culture Code is at the core of everything we do. Our values and ways of working help our people thrive and realize their potential, which leads to even better outcomes for our customers.

When you’re an Autodesker, you can be your whole, authentic self and do meaningful work that helps build a better future for all. Ready to shape the world and your future? Join us!

Salary transparency

Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, we also have a significant emphasis on discretionary annual cash bonuses, commissions for sales roles, stock or long-term incentive cash grants, and a comprehensive benefits package.

Diversity & Belonging
We take pride in cultivating a culture of belonging and an equitable workplace where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).

Autodesk Toronto, Ontario, CAN Office

661 University Ave, Toronto, ON, Canada, M5G 1M1

Similar Jobs

3 Days Ago
Toronto, ON, CAN
Mid level
Mid level
Enterprise Web • Fintech • Financial Services
As a Site Reliability Engineer, you will ensure the reliability and performance of cloud-based infrastructure, working closely with development and operations teams to automate processes and improve observability. Key responsibilities include implementing observability platforms, managing incidents, and using Infrastructure as Code with tools like Terraform and Kubernetes. Your role will also focus on building resilient systems and collaborating to ensure operational excellence.
Top Skills: AWSBashCdkCi/CdCloud-Native InfrastructureCloudFormationContainersDatadogGitLinuxMonitoring ToolsNew RelicPythonSplunkTerraform
4 Days Ago
6 Locations
Junior
Junior
Artificial Intelligence • Digital Media • Marketing Tech • Software
As a Site Reliability Engineer, you will develop, deploy, and maintain Kubernetes-based infrastructure while contributing to the design of cloud-native applications. Responsibilities include operational tasks, CI/CD implementation, monitoring the platform, and collaborating with teams to resolve issues.
Top Skills: ArgocdAWSAzureGitopsGoGrafanaHelmKubernetesNode.jsPrometheusPython
3 Days Ago
Toronto, ON, CAN
Senior level
Senior level
Financial Services
The Site Reliability Engineer monitors and supports critical systems, ensuring service restoration and incident management, while collaborating with development teams to enhance monitoring, performance, and reliability of software solutions. They are responsible for operational support, deployment, and implementing improvements across the infrastructure.
Top Skills: Amazon S3AnsibleApache MesosAWSAzureBashCephCloudwatchDynatraceGitHdfsJIRAKubernetesNfsRest ApisUnixYarn

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account