Build and operate MLOps and agentic infrastructure: manage model registries, continuous training loops, and A/B testing; deploy agents as scalable Kubernetes microservices; and create observability dashboards tracking token usage, latency, and agent reasoning.
We are seeking a skilled MLOps & Agentic Platform Engineer. This role involves managing model registries, developing continuous training loops, and implementing A/B testing infrastructure. The ideal candidate will have a strong DevOps/MLOps background and be adept at deploying scalable microservices and building observability dashboards.
Responsibilities:
- Manage model registries, continuous training loops, and A/B testing infrastructure.
- Deploy agents as scalable microservices on Kubernetes.
- Build observability dashboards to track token usage, latency, and agent reasoning paths.
Qualifications:
- Strong DevOps/MLOps background (Kubernetes, Docker, Terraform).
- Experience with MLflow, Weights & Biases, or LangSmith.
- Knowledge of building scalable microservice architectures.
Similar Jobs
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Lead end-to-end enterprise sales for Square9s upmarket business: craft deal strategy, manage complex technical integrations and multi-stakeholder negotiations, partner with Solutions Engineering, align internal teams, represent the company to executives, and close high-value contracts while influencing product and go-to-market strategy.
Top Skills:
Ai ToolsAPIsPaymentsSaaSSquare
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead customer retention and adoption for ServiceNow customers by identifying churn risk, partnering with Sales on adoption/retention plans, advising on governance and SLA issues, and improving customer satisfaction through consulting, project oversight, and executive engagement.
Top Skills:
AIAi-Powered ToolsServicenow
HR Tech • Information Technology • Professional Services • Sales • Software
Design, develop, and maintain scalable backend systems for the Payroll product using a microservices architecture. Own the full development lifecycle from technical design to deployment and monitoring, collaborate with product and front-end teams, build and optimize APIs, and work in a continuous delivery environment with automated QA and testing practices.
Top Skills:
APIsAutomated QaAWSContinuous DeliveryJavaKotlinMicroservicesMockingMonitoringMySQLPostgresScalaTddUnit Testing
What you need to know about the Toronto Tech Scene
Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

.png)

