Faire Logo

Faire

Staff Machine Learning Platform Engineer

Posted 8 Hours Ago
Be an Early Applicant
In-Office
2 Locations
Senior level
In-Office
2 Locations
Senior level
Design and operate a scalable ML platform for model training, deployment, and governance while optimizing performance and supporting data scientists in productionizing workflows.
The summary above was generated by AI

About Faire

Faire is a technology wholesale platform built on the belief that the future is local. Independent retailers around the globe collectively represent a multi-hundred-billion-dollar wholesale market that has historically been fragmented and offline. At Faire, we're using the power of tech, data, and machine learning to connect this thriving community of entrepreneurs across the globe. Picture your favorite boutique in town — we help them discover the best products from around the world to sell in their stores. With the right tools and insights, we believe that we can level the playing field so businesses can grow and local communities can thrive.

We’re looking for smart, resourceful and passionate people to join us as we power the shop local movement. If you believe in community, come join ours.

About this role

As a Staff Machine Learning Platform Engineer, you will help design, improve, and operate a scalable ML platform to accelerate model training, deployment, and governance. You are the technical bridge between data science and production engineering.  You’ll be joining a small but deeply critical team that scales Faire’s ability to support tens of thousands of local businesses in a constantly narrowing retail landscape.

What You Will Do

  • Design and operate ML infrastructure, including workspaces, clusters, jobs, and workflows
  • Productionize ML workloads using Spark, Delta Lake, MLflow, and Databricks Workflows
  • Teach data scientists how to utilize our ML platform to advance development from notebook to production for our most critical models
  • Implement Unity Catalog for data governance, lineage, access control, and secure multi-tenant usage
  • Build CI/CD pipelines for ML using Terraform and Git-based workflows (e.g., GitHub Actions)
  • Optimize performance, reliability, and cost across training and inference workloads
  • Configure Identity and Access Management (IAM) and Role Based Authentication Controls (RBAC) for sensitive data sets
  • Establish observability for data quality, model performance, and platform health
  • Build and maintain ML Platform technical documentation

What it takes

  • 8+ years of experience building production ML or data platforms
  • A degree (preferably graduate level) in Computer Science, Engineering, Statistics, or a related technical field
  • Strong hands-on expertise with Databricks, Spark, Delta Lake, and MLflow.
  • Proficiency in Python, SQL, and distributed systems concepts
  • Experience with cloud platforms and infrastructure-as-code
  • Solid understanding of MLOps best practices: CI/CD, monitoring, reproducibility, and security
  • Experience supporting multiple ML teams in a shared platform environment
  • Are an active owner of orphaned problems and are willing to assimilate whatever knowledge you’re missing to get the job done

Tech Stack

Faire uses a modern cloud based tech stack.  For this role, you’ll want to be proficient with the following:

Category

Technologies

Languages

Python, SQL, Kotlin

ML Frameworks

PyTorch, MLFlow 

Big Data & Processing

Spark, Kafka, Databricks, Snowflake, Fivetran, Iceberg, Unity Catalog, Datadog, Airflow, Cockroach DB, MySQL

Cloud & Infrastructure

AWS, S3, SageMaker, Kubernetes, Docker, GitHub Actions, Terraform

Generative AI

Claude Sonnet 4.5, ChatGPT 5.2

Salary Range

Canada: the pay range for this role is $216,000 to $297,000 per year. 

This role will also be eligible for equity and benefits. Actual base pay will be determined based on permissible factors such as transferable skills, work experience, market demands, and primary work location. The base pay range provided is subject to change and may be modified in the future.

Faire uses Artificial Intelligence (AI) to screen and select applicants for this position.

This job posting is for an existing vacancy.

Hybrid Faire employees currently go into the office 3 days per week on Tuesdays, Thursdays, and a third flex day of their choosing (Monday, Wednesday, or Friday). Additionally, hybrid in-office roles will have the flexibility to work remotely up to 4 weeks per year. Specific Workplace and Information Technology positions may require onsite attendance 5 days per week as will be indicated in the job posting. 

Why you’ll love working at Faire

  • Move fast: You'll own meaningful problems that serve customers around the globe with the agency to move fast and see your results clearly.
  • Equipped to scale: We invest in what matters, including the latest enterprise AI tools, to help you work smarter and get more out of every day.
  • Best in class: Our team is full of sharp, kind, and generous colleagues who care about their craft and about helping you grow in yours.
  • Real rewards. Competitive pay, equity, and comprehensive benefits designed to support your life inside and outside of work.
  • Belonging: We're intentional about building an environment where every Faire employee has equal access to opportunities, growth, and success.

Faire was founded in 2017 by a team of early product and engineering leads from Square. We’re backed by some of the top investors in retail and tech including: Y Combinator, Lightspeed Venture Partners, Forerunner Ventures, Khosla Ventures, Sequoia Capital, Founders Fund, and DST Global. We have headquarters in San Francisco and Kitchener-Waterloo, and a global employee presence across offices in Toronto, London, and New York. To learn more about Faire and our customers, you can read more on our blog.

Faire provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, genetics, sexual orientation, gender identity or gender expression.

Faire is committed to providing access, equal opportunity and reasonable accommodation for individuals with disabilities in employment, its services, programs, and activities. Accommodations are available throughout the recruitment process and applicants with a disability may request to be accommodated throughout the recruitment process. We will work with all applicants to accommodate their individual accessibility needs.  To request reasonable accommodation, please fill out our Accommodation Request Form (https://bit.ly/faire-form)

Privacy

For information about the type of personal data Faire collects from applicants, as well as your choices regarding the data collected about you, please visit Faire’s Privacy Notice (https://www.faire.com/privacy)

Faire Kitchener, Ontario, CAN Office

260 King Street West, 205, Kitchener, Ontario, Canada, N2G 1B6

Faire Toronto, Ontario, CAN Office

Toronto, Ontario, Canada

Faire Waterloo, Ontario, CAN Office

85 Willis Way, Waterloo, ON, Canada, N2J 0B9

Similar Jobs

5 Days Ago
Hybrid
Toronto, ON, CAN
Expert/Leader
Expert/Leader
Gaming • Esports
The Staff Platform Engineer will design and scale the unified data and ML platform, ensuring best practices for data ingestion, storage, and real-time serving. Responsibilities include optimizing workflows, overseeing platform components, and providing technical leadership across teams to enhance decision latency and data reliability.
Top Skills: DbtGoKafkaMlflowPythonScalaSpark
16 Days Ago
Easy Apply
Remote or Hybrid
Ontario, ON, CAN
Easy Apply
Senior level
Senior level
Artificial Intelligence • Machine Learning • Retail • Social Impact • Software
The Staff Software Engineer will enhance the ML platform's performance and scalability, collaborating with teams to support predictive modeling and real-time inference capabilities.
Top Skills: AirflowDatabricksNumpyPandasPysparkPythonTorch
3 Days Ago
Hybrid
Toronto, ON, CAN
Senior level
Senior level
Gaming • Esports
The Senior ML Platform Engineer will develop scalable machine learning solutions, manage ML infrastructure, and ensure observability in ML workflows while collaborating with cross-functional teams.
Top Skills: AirflowAws SagemakerGoJavaPythonPyTorchRedisScikit-LearnTensorFlowTerraformXgboost

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account