Stripe Logo

Stripe

Software Engineer, Machine Learning Infrastructure

Reposted 24 Days Ago
In-Office
Toronto, ON, CAN
Junior
In-Office
Toronto, ON, CAN
Junior
You will design and build scalable ML infrastructure services, improve productivity for ML engineers, and enhance MLOps across the company.
The summary above was generated by AI
Who we areAbout Stripe

Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.

About the team

Stripe processes over $1T in payments volume per year, which is roughly 1% of the world’s GDP. The tremendous amount of data makes Stripe one of the best places to do machine learning. The ML Infra team builds services and tools that power every step in the ML lifecycle, including data exploration, feature generation, experimentation, training, deploying, serving ML models, and building LLM applications. With the phenomenal developments happening in the field of AI, we are positioned to accelerate the adoption of AI/ML across all parts of the company by building highly scalable and reliable foundational infrastructure.

What you’ll do

You will work closely with machine learning engineers, data scientists, and product engineering teams to enable seamless end-to-end experience in building solutions across data, analytics, and AI/ML platforms. You will build the next generation of ML Infra services and major new capabilities that substantially improve ML development velocity and MLOps maturity across the company.

Responsibilities 
  • Designing and building scalable, reliable, and secure services for notebooks, ML model training, experimentation, serving, and LLM applications across multiple regions. 
  • Creating services and libraries that enable ML engineers at Stripe to seamlessly transition from experimentation to production across Stripe’s systems. 
  • Working directly with product teams and ML engineers to improve their day-to-day productivity. 
  • Taking ownership of and finding solutions for technical and product challenges by working with a diverse set of systems, processes, and technologies.
Who you are

We’re looking for people with a strong background or interest in building successful products or systems; you’re passionate about solving business problems and making impact, you are comfortable in dealing with lots of moving pieces; and you’re comfortable learning new technologies and systems. You are comfortable working with other Stripe teams across the US and Canada.

Minimum requirements
  • 2+ years of professional software development experience with a solid background on service oriented architecture and large-scale distributed systems
  • Experience working through the full life cycle of software development, from talking to users, to design and implementation, to testing and deployment, to operations
  • Experience working on production ML platforms, MLOps solutions, or building LLM applications
  • Experience running operations for high availability, low latency systems
  • Experience partnering with other teams to drive business outcomes
  • A sense of pragmatism: you know when to aim for the ideal solution and when to adjust course
Preferred qualifications 
  • Experience building and shipping production AI agents
  • Familiarity with the LLMs and LLM Frameworks
  • Experience training and shipping machine learning models to production to solve critical business problems

Top Skills

Ai Agents
Distributed Systems
Llm Applications
Machine Learning Models
Mlops
Service Oriented Architecture

Similar Jobs

11 Days Ago
In-Office or Remote
Toronto, ON, CAN
Mid level
Mid level
Robotics
As a Software Engineer in ML Infrastructure, you will develop scalable data pipelines, enhance data discovery, and collaborate with teams on data organization and ML features.
Top Skills: Data Annotation PlatformsLlmsPythonSQLVector DatabasesVlms
21 Days Ago
Easy Apply
In-Office or Remote
CA
Easy Apply
Junior
Junior
Logistics • Transportation
The role involves designing and developing machine learning infrastructure for annotation, evaluation, and training models, focusing on scalable systems and efficient data workflows.
Top Skills: C++KubernetesPythonSpark
2 Days Ago
In-Office
Toronto, ON, CAN
Mid level
Mid level
Artificial Intelligence • Digital Media
Collaborate with engineers to build AI design systems, optimize data infrastructure, and manage large-scale distributed systems for model development.
Top Skills: DockerGCPGoogle BigqueryGoogle BigtableGoogle Pub/SubGoogle SpannerKubernetesPythonTerraform

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account