PointClickCare Logo

PointClickCare

Intermediate Site Reliability Engineer

Job Posted 4 Days Ago Posted 4 Days Ago
Be an Early Applicant
Hybrid
Mississauga, ON
Senior level
Hybrid
Mississauga, ON
Senior level
The Intermediate Site Reliability Engineer will enhance application reliability, lead incident responses, mentor junior engineers, and oversee implementation of SRE best practices while improving automation and system performance.
The summary above was generated by AI

PointClickCare is a leading North American healthcare technology platform enabling meaningful care collaboration and real‐time patient insights. For over 20 years, the company has been focused on realizing its vision: to help create a world in which providers and plans can confidently deliver frictionless care. Since its inception, PointClickCare has grown exponentially, with over 2,200 employees working to impact millions across North America. Recognized by Forbes as one of the top 100 private cloud companies and acknowledged by Waterstone Human Capital as Canada’s Most Admired Corporate Cultures, PointClickCare leads the way in creating cloud-based healthcare software.

 

At PointClickCare, we offer a wealth of opportunities and a vibrant culture that empowers our employees. Our dynamic environment is the perfect place to advance your career while engaging in meaningful work alongside incredible colleagues. Here, you’ll discover a space where your talents can thrive, your career can grow, and your work will have a lasting impact on healthcare across North America. We believe that work becomes profoundly fulfilling when driven by a higher purpose.

 

Join us and be part of a team that is making a real impact.

 

To learn more about us, check out Life at PointClickCare and connect with us on Glassdoor and LinkedIn.


Role Summary:


We are seeking an Intermediate Site Reliability Engineer (SRE) with at least 5 years of experience. Passionate about enhancing application reliability, performance and security. The ideal candidate will possess strong development, engineering and architecture skills, enabling them to troubleshoot code effectively to identify and resolve application issues such as: 

Dependency Failures: API failures, database query bottlenecks, or external service timeouts. Resource Exhaustion: Thread pool exhaustion, memory leaks, CPU/ Memory spikes or spikes in response time.  

Code Issues: Race conditions, infinite loops or improper exception handling. 

Additionally, you will act as a mentor and role model for the team, inspiring and guiding them towards achieving a high level of expertise in site reliability engineering. Your leadership will be key in fostering a culture of continuous improvement, enabling team members to develop the skills necessary to transition into SRE roles in the future.


Responsibilities:


•Lead and implement SRE best practices to foster a strong SRE culture. 

•Coach, mentor and develop junior team members to grow into SRE’s. 

•Lead incident response calls to troubleshoot complex system and application-level issues. 

•Lead RCAs to capture lessons learnt and implement innovative solutions to prevent future incidents from re-occurring.

•Design automated solutions to reduce manual tasks, enhance system reliability and reduce incident response time. 

•Implement and improve monitoring, alerting and logging utilizing tools such as ELK/Kibana, Prometheus, Grafana, AppD and Datadog.

•Implement, monitor, and report on Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for application services.

•Collaborate with business and product owners to establish key performance indicators (KPIs).

•Participate in technical training events, game day scenarios, and chaos engineering.

•Provide support for a wide range of applications with a focus on increasing automation, repeatability, and consistency as well as self-healing. 

•Proactively improve application and infrastructure resiliency under various error and performance conditions.

•Collaborate with security engineers to develop plans and automation for proactive response to new risks and vulnerabilities.

•Provide architectural guidance to software development teams to enhance resiliency, efficiency, performance, and cost-effectiveness.

•Implement and improve CI/CD pipelines to facilitate seamless and reliable software releases.

•Participate in an on-call rotation to respond to incidents and ensure 24/7 system availability.


Qualifications:


•Bachelor's Degree in Computer Science, Software Engineering, or related discipline.

•Prior experience as a Site Reliability Engineer (SRE) in a previous role. (Minimum 5 years’ experience.)

•Prior relevant software Development/Architecture/Engineering/DevOPS experience (Minimum 5 years’ experience).

•Strong experience in building and supporting cloud-based solutions, Azure cloud infrastructure and services knowledge and experience preferred.

•Experience with virtualization and container solutions such as Docker and Kubernetes. 

•Familiarity with Databricks, Event Hub, Redis, Azure Service Bus, Azure Functions, and Tomcat.

•Experience with Windows based systems and Linux administration.

•Experience with configuration management and deployment automation tools (e.g., Chef, Terraform, Puppet, Ansible, Jenkins, Spinnaker, ArgoCD, GitHub Actions). 

•Proficiency in programming languages such as Java, JavaScript and Python. 

•Working knowledge of database technologies (e.g., SQL Server, MySQL, PostgreSQL).

•Experience with monitoring and logging solutions (e.g., Prometheus, Grafana, ELK stack, AppDynamics, DataDog).

•Strong debugging and optimization skills, with the ability to automate routine tasks.

•Systematic problem-solving approach with strong communication skills and a proactive mindset.

•Knowledge of standard production practices, including change management and incident management (ITIL).

•Experience building CI/CD pipelines and Blue/Green, Zero Downtime deployment strategies. 

•Troubleshooting experience with diverse hosting technologies, web servers, Java applications, operating systems, network components, and web browsers.


Nice to Have:


•Proficiency in Linux, including experience compiling kernels, tracing syscalls, understanding TCP.

•Knowledge of open-source software and contributions to the open-source community.

•Familiarity with Rhapsody and various healthcare messaging standards, such as HL7 and FHIR. 


#LI-hybrid

#LI-AJ1


PointClickCare Benefits & Perks:

Benefits starting from Day 1!

Retirement Plan Matching

Flexible Paid Time Off

Wellness Support Programs and Resources

Parental & Caregiver Leaves

Fertility & Adoption Support

Continuous Development Support Program

Employee Assistance Program

Allyship and Inclusion Communities

Employee Recognition … and more!


It is the policy of PointClickCare to ensure equal employment opportunity without discrimination or harassment on the basis of race, religion, national origin, status, age, sex, sexual orientation, gender identity or expression, marital or domestic/civil partnership status, disability, veteran status, genetic information, or any other basis protected by law. PointClickCare welcomes and encourages applications from people with disabilities. Accommodations are available upon request for candidates taking part in all aspects of the selection process. Please contact recruitment@pointclickcare.com should you require any accommodations.


When you apply for a position, your information is processed and stored with Lever, in accordance with Lever’s Privacy Policy. We use this information to evaluate your candidacy for the posted position. We also store this information, and may use it in relation to future positions to which you apply, or which we believe may be relevant to you given your background. When we have no ongoing legitimate business need to process your information, we will either delete or anonymize it.  If you have any questions about how PointClickCare uses or processes your information, or if you would like to ask to access, correct, or delete your information, please contact PointClickCare’s human resources team: recruitment@pointclickcare.com 


PointClickCare is committed to Information Security. By applying to this position, if hired, you commit to following our information security policies and procedures and making every effort to secure confidential and/or sensitive information.

Top Skills

Ansible
Appdynamics
Argocd
Azure
Chef
Datadog
Docker
Elk
Github Actions
Grafana
Java
JavaScript
Jenkins
Kubernetes
MySQL
Postgres
Prometheus
Puppet
Python
Spinnaker
SQL Server
Terraform
HQ

PointClickCare Mississauga, Ontario, CAN Office

5570 Explorer Drive, Mississauga, Ontario, Canada

Similar Jobs

20 Days Ago
Toronto, ON, CAN
Mid level
Mid level
Financial Services
Lead SRE initiatives, analyze data, develop solutions in a cloud environment, and maintain systems for high availability. Collaborate with stakeholders and improve technology capabilities.
Top Skills: AppdynamicsAWSCloudwatchDynamo DbDynatraceEc2ElbElk StackGrafanaIamJavaKmsLambdaNode.jsPrometheusPythonRdsS3SplunkVpc
4 Hours Ago
Easy Apply
Hybrid
Toronto, ON, CAN
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software • Big Data Analytics • Automation
As a Staff Machine Learning Engineer, you will lead AI initiatives, mentor team members, improve data platforms and drive AI standards across the organization.
Top Skills: AICloud-Based Data InfrastructuresData ArchitectureMachine LearningMl OperationsSaaS
Yesterday
Hybrid
Toronto, ON, CAN
Senior level
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Staff Software Engineer, you will lead technical design and development, mentor junior engineers, and improve software practices in a collaborative team setting.
Top Skills: AWSGCPJavaAzure

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account