Flinks Logo

Flinks

Senior Reliability Engineer

Sorry, this job was removed at 06:01 p.m. (EST) on Thursday, Apr 24, 2025
Be an Early Applicant
Remote
2 Locations
Remote
2 Locations

Similar Jobs

2 Days Ago
Remote
Canada
Senior level
Senior level
Cloud • Information Technology • Machine Learning • Mobile
The Senior Reliability Engineer will enhance the reliability of data and cloud platforms, monitor workloads, optimize ETL workflows, and automate processes to support retail operations.
Top Skills: AzureAzure Data FactoryAzure DevopsAzure Logic AppsAzure MonitorLog AnalyticsPower BIPysparkSQL
3 Days Ago
Remote
Ontario, ON, CAN
Senior level
Senior level
Information Technology • Marketing Tech • Social Media
The Senior Site Reliability Engineer will design and manage CI/CD pipelines, support infrastructure components, coordinate releases, and mentor junior members for system reliability.
Top Skills: AnsibleAWSCdkGoGradleJenkinsKubernetesLinuxMavenMySQLPostgresPulumiPythonRubySQLTerraform
5 Days Ago
Remote
Canada
Senior level
Senior level
Financial Services
As a Senior Site Reliability Engineer, you will ensure uptime and reliability, advocate for team infrastructure needs, and improve system observability while supporting automation tools and developer productivity.
Top Skills: AWSKubernetesRubySQL

Description

About Flinks 

Flinks is where financial data moves—with purpose, trust, and impact.

We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless, secure data connectivity.

From instant account funding to smarter lending, our solutions help power some of the most innovative financial products in North America. We partner with lenders, banks, and fintechs to streamline onboarding, prevent fraud, and fuel real-time decision-making with enriched, reliable data.

As pioneers in Canada’s open banking movement, we're not waiting for the future—we're building it. If you're bold, curious, and ready to help shape the future of finance, we’d love to meet you.

About the Reliability Team 🚒

As a Senior Reliability Engineer, you will play a pivotal role in ensuring the stability, performance, and reliability of Flinks Fintech product platforms, and monitoring & alerting systems. You will serve as an expert in both software development and system support, working closely with engineering, operations, and product teams to troubleshoot complex issues, resolve incidents, and continuously improve the technical foundation of our products. This role demands a combination of advanced coding skills, incident management experience, and an understanding of the fin-tech industry.

What You’ll Do

  • Develop and maintain code to quickly resolve product issues, ensuring fast recovery and long-term system stability.
  • Provide live operational support across multiple client applications, monitoring services and alerts to detect and resolve critical failures with minimal downtime.
  • Own and troubleshoot complex incidents, conduct root cause analyses, and implement long-term solutions—adhering to SLAs and internal SLOs.
  • Build monitoring dashboards and alerting systems to proactively detect and address issues, supporting system scalability and stability.
  • Analyze operational metrics and KPIs to identify trends, surface client pain points, and drive improvements.
  • Automate tooling and processes to improve efficiency and reduce manual work across LiveOps.
  • Collaborate with cross-functional teams to deliver lasting fixes for production issues and contribute to technical analyses of product gaps.
  • Lead and mentor reliability engineers, providing guidance and ensuring consistent delivery of high-quality work.
  • Participate in post-incident reviews, documenting outcomes and driving preventative action items.
  • Support after-hours on-call coverage as part of the LiveOps rotation

Who You Are 💪

  • 5+ years of experience with .NET Framework (C#), ensuring production system stability
  • Strong coding, debugging, and troubleshooting skills, particularly in performance optimization of large-scale applications
  • Operationally focused with expertise in incident management and resolving live production issues
  • Proven experience in building and maintaining reliable monitoring and alerting systems in high-demand environments, with a focus on production support
  • Strong knowledge of Kubernetes, Docker, and cloud platforms (GCP preferred)
  • Proficiency with monitoring tools like Prometheus, Grafana, and Kibana
  • Experience with incident ticketing/documentation tools like FreshDesk and Confluence
  • Critical thinker who can identify system weaknesses and find innovative solutions
  • Strong project management skills with a focus on scalability and system stability

Nice to haves

  • ITIL Service Management certification (or equivalent) is highly desired, such as ITIL v3, ITIL v4, or other equivalent certifications.
  • Experience with PowerBI, web scraping, or Golang

The Interview Process 🏗

  1. Head of People Ops
  2. Case Assignment & Presentation
  3. Team Lead Interview
  4. Director Interview

Flinks Toronto, Ontario, CAN Office

119 Spadina Ave, , CA, Toronto, Ontario , Canada, M5V 2K2,

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account