Autodesk Logo

Autodesk

Senior Site Reliability Engineer

Reposted 9 Hours Ago
Be an Early Applicant
In-Office
Toronto, ON
Senior level
In-Office
Toronto, ON
Senior level
The Senior Site Reliability Engineer manages AWS infrastructure, ensuring reliability and performance. Responsibilities include architecture, cloud automation, CI/CD processes, and operational support.
The summary above was generated by AI

Job Requisition ID #

25WD92369

Position Overview

We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloud

infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring

the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering

Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the

platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners.

Responsibilities

  • Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture

  • Independently manage requirement analysis, solution design, implementation, and release planning

  • Ensure high adherence to trust and security compliance, guidelines and standards

  • Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security

  • Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices

  • Implement and maintain configuration management and infrastructure as code (IaC) using Terraform

  • Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and period maintenance activities

  • Contribute to critical vulnerability (CVEs) remediation efforts

  • Promote and document security and best practices across all pillars of DevOps/SRE throughout system design

  • Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues

  • Participate in on-call rotations, providing critical 24x7 support for production systems

Minimum Qualifications

  • Bachelor’s degree or higher in Computer Science, Engineering, or a related field

  • 5+ years of progressive experience in Site Reliability Engineering, DevOps, or a similar field

  • Proficiency with managing AWS resources and understanding of networking and security protocols

  • Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation

  • Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory

  • Experience with container-based technologies like Docker and AWS ECS

  • Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch

  • Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment

  • Proficiency in programming languages such as UNIX, Python, Go, Bash, Groovy, and Node.js

  • Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, GoLang, Node.js, Groovy, Python, Jenkins, GitHub, Jira, ServiceNow, and Splunk.

Preferred Qualifications

  • Knowledge in applying AI and ML solutions for engineering processes and/or DevOps automation

  • Knowledge of standardized observability frameworks such as OpenTelemetry

  • Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer)

  • Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures

  • Broad knowledge with data streaming pipelines like Kinesis, Firehose, and Kafka

  • Knowledge on core Java and SpringBoot concepts in JVM optimization

  • Knowledge on build tools, e.g. Gradle

  • Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment

  • Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership

#LI-AD1

Learn More

About Autodesk

Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.

When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!

Salary transparency

Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.

Diversity & Belonging
We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).

Top Skills

AWS
Bash
CloudFormation
Datadog
Docker
Elk Stack
Git
Go
Grafana
Groovy
Java
Jenkins
Kafka
Node.js
Python
Servicenow
Splunk
Spring Boot
Terraform
Unix

Autodesk Toronto, Ontario, CAN Office

661 University Ave, Toronto, ON, Canada, M5G 1M1

Similar Jobs

14 Days Ago
In-Office or Remote
Toronto, ON, CAN
Senior level
Senior level
Insurance
The Senior Site Reliability Engineer at Zensurance will focus on enhancing production systems' reliability, scalability, and performance through automation, best practices, and incident management, while mentoring junior engineers.
Top Skills: AWSDatadogElk StackGithub ActionsGrafanaKubernetesPrometheusSplunkTerraformTypescript
3 Days Ago
In-Office
2 Locations
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics
The role involves operating and scaling Kong's SaaS platform, building automated infrastructure, optimizing multi-region data layers, enhancing observability, and ensuring reliability across services.
Top Skills: ArgocdAWSAzureBashClickhouseDatadogDruidGCPGoGrafanaHelmKubernetesPostgresPrometheusPythonRedisTerraformTerragruntThanos
21 Days Ago
In-Office or Remote
Toronto, ON, CAN
Senior level
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer, you'll ensure the reliability and scalability of enterprise applications, manage incidents, and mentor team members while using Java and modern open-source technologies.
Top Skills: AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account