1851 Labs Jobs

Machine Learning Engineer

1851 Labs

Machine Learning Engineer

Posted 5 Days Ago

Be an Early Applicant

In-Office

Toronto, ON, CAN

Senior level

In-Office

Toronto, ON, CAN

Senior level

Build and productionize real-time ML systems for consumer AI: serve and optimize large-scale diffusion and LLM inference, improve model performance (quantization, distillation, pruning), implement personalization, ranking, and moderation, scale GPU infrastructure, run experiments and A/B tests, and own reliability, observability, and cost-performance tradeoffs.

The summary above was generated by AI

About GenTube

GenTube is a consumer AI creation platform built on a simple belief: creation should be entertainment.

Last year, people created 70M+ images on GenTube. What matters more is what’s emerging now: a small but growing group opens the app with no prompt, no goal, and stays for hours. No nudges. No incentives. That behavior is the signal we’re building around.

We’re an early, opinionated team based in Toronto, backed by top consumer AI investors and operators who’ve built at global scale.

Our ambition is straightforward and hard: build the next great consumer AI creation company for a billion people.

The Role

We’re hiring a Product ML Engineer to build the intelligence layer of GenTube.

This is not a research-only role.

And not an infra-only role.

You’ll work at the intersection of models, systems, and product — shipping ML that real users feel every day. You’ll make explicit tradeoffs between speed, quality, cost, and delight — and measure them.

If you want ownership, rigor, and real-world scale, keep reading.

What You’ll DoCore ML Infrastructure

Build inference pipelines serving millions of generations per week.Core ML Infrastructure
Design real-time and streaming inference for diffusion models, LLMs, and multimodal systems.
Optimize latency across serving, batching, caching, routing, and model selection.

Model Performance

Adapt and productionize foundation models (SD, Flux, LLMs).
Implement quantization, distillation, pruning, and compilation.
Experiment with LoRAs, ControlNets, adapters for style, control, and personalization.

Intelligence Layers

Build ranking, recommendation, and personalization systems.
Implement content understanding with embeddings, similarity search, clustering, classification.
Build moderation and safety systems that scale without killing creativity.

Production Systems

Scale GPU infrastructure from thousands to millions of daily generations.
Profile bottlenecks and optimize utilization and cost.
Run A/B tests on model variants; monitor quality, drift, and p99 latency.
Own reliability, observability, and graceful degradation.

Relentless Experimentation

Ship new model variants frequently.
Test speed vs. quality tradeoffs using real user behavior.
Close the loop: user behavior → signal → model improvement.

What We’re Looking ForCore ML Infrastructure

Build inference pipelines serving millions of generations per week.Core ML Infrastructure
Design real-time and streaming inference for diffusion models, LLMs, and multimodal systems.
Optimize latency across serving, batching, caching, routing, and model selection.

Model Performance

Adapt and productionize foundation models (SD, Flux, LLMs).
Implement quantization, distillation, pruning, and compilation.
Experiment with LoRAs, ControlNets, adapters for style, control, and personalization.

Intelligence Layers

Build ranking, recommendation, and personalization systems.
Implement content understanding with embeddings, similarity search, clustering, classification.
Build moderation and safety systems that scale without killing creativity.

Production Systems

Scale GPU infrastructure from thousands to millions of daily generations.
Profile bottlenecks and optimize utilization and cost.
Run A/B tests on model variants; monitor quality, drift, and p99 latency.
Own reliability, observability, and graceful degradation.

Relentless Experimentation

Ship new model variants frequently.
Test speed vs. quality tradeoffs using real user behavior.
Close the loop: user behavior → signal → model improvement.

Why Join

Founders have scaled consumer products to 100M+ users and led a $150M+ AI exit.
Backed by top consumer AI investors and operators.
We’re building the kind of company Canada rarely builds — consumer-first, global, culturally relevant.
Small team. High bar. No bureaucracy.
A rag-tag group of pirates in the desert.

Location: Toronto (downtown). On-site.

Comp: Competitive salary + meaningful equity.

Benefits: Health, dental, vision, unlimited PTO, creative tools & education stipend.

Taste, curiosity, and ownership matter more than pedigree.

If you want to ship ML that millions of people feel, measure what works, and push the edge of consumer AI — we want to hear from you.

Apply by sending your application to [email protected]

Similar Jobs

Block

Machine Learning Engineer

Yesterday

In-Office or Remote

Expert/Leader

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency

Lead development and production of underwriting and credit decisioning models across Cash App Borrow and Afterpay. Own full modeling lifecycle: problem formulation, feature engineering, training, calibration, experimentation, deployment, monitoring, and iteration. Build decision frameworks, agentic engineering workflows, and collaborate with cross-functional partners to align model behavior with business and regulatory goals.

Top Skills: AirflowAWSClaude CodeCopilotCursorFeature StoreGCPGitLightgbmMlflowModel Hosting PlatformNumpyPandasPrefectPythonPyTorchScikit-LearnSnowflakeSQLXgboost

Cash App

Machine Learning Engineer

2 Days Ago

Remote or Hybrid

Expert/Leader

Blockchain • Fintech • Mobile • Payments • Software • Financial Services

Senior individual contributor building and maintaining underwriting and credit decisioning ML systems for Cash App Borrow and Afterpay. Responsibilities include feature engineering, model training, calibration, experimentation, deployment, monitoring, and portfolio-level analysis. Collaborate with cross-functional teams to align models with business and regulatory goals and develop AI-native engineering workflows and governance for reliable, auditable model development.

Top Skills: AirflowAWSClaude CodeCopilotCursorGCPGitInternal Feature StoreLightgbmMlflowModel Hosting PlatformNumpyPandasPrefectPythonPyTorchScikit-LearnSnowflakeSQLXgboost

Block

Machine Learning Engineer

12 Days Ago

In-Office or Remote

Expert/Leader

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency

Design, build, and operate production ML decision systems to detect and prevent payment fraud, account takeover, scams, and other abuse. Integrate diverse signals into low-latency serving and batch scoring, own feature pipelines and model lifecycle, develop AI-assisted triage and feedback loops, and partner cross-functionally to balance fraud reduction with legitimate customer access.

Top Skills: Cloud InfrastructureData LakehouseData WarehouseEmbeddingsFeature StoreJavaKafkaKotlinKubernetesLightgbmModel ServingMonitoringObservabilityPythonPyTorchSQLTensorFlowWorkflow OrchestrationXgboost

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.