Huawei Canada Jobs

Agentic RL Researcher – Distributed Computing

Huawei Canada

Agentic RL Researcher – Distributed Computing

Reposted 12 Days Ago

Be an Early Applicant

In-Office

Markham, ON, CAN

Mid level

In-Office

Markham, ON, CAN

Mid level

Design and develop advanced RL algorithms, build simulation platforms, optimize multi-agent learning on distributed clusters, and translate research into production systems.

The summary above was generated by AI

Huawei Canada has an immediate permanent opening for a Researcher.
About the team:

The Distributed Data Storage and Management Lab leads research in distributed data systems, aiming to develop next-generation cloud serverless products that encompass core infrastructure and databases. This lab addresses various data challenges, including cloud-native disaggregated databases, pay-by-query user models, and optimizing low-level data transfers via RDMA. Teams within this lab create advanced cloud serverless data infrastructure and implement cutting-edge networking technologies for Huawei's global AI infrastructure.

About the job:

Design and develop advanced Agentic Reinforcement Learning (RL) and Multi-Agent Reinforcement Learning (MARL) algorithms for cooperative, competitive, and mixed-agent environments, including CTDE, decentralized learning, and hierarchical agent systems.
Build scalable simulation and training platforms for large-scale agent systems, supporting self-play, population-based training, curriculum learning, and emergent behavior analysis.
Optimize multi-agent learning performance on distributed compute clusters, improving sample efficiency, credit assignment, agent coordination, communication learning, and training stability.
Research and prototype new approaches for multi-agent intelligence, including communication protocols, credit assignment, game-theoretic learning dynamics, meta-learning, and adaptive agent populations.
Translate cutting-edge research in agentic AI and MARL into production-ready systems for real-world or high-fidelity simulated environments.
Develop benchmarking frameworks and evaluation metrics for agent coordination, robustness, scalability, and safety.
Collaborate with research, infrastructure, and product teams to deploy scalable agentic learning systems in real-world applications.
Contribute to technical leadership and innovation through publications, patents, open-source contributions, and conference presentations.

The total target annual compensation for this position ranges from $106,000 to $156,000 depending on education, experience, and demonstrated expertise.

About the ideal candidate:

MS or PhD in Computer Science, Electrical Engineering, or a related field, with a focus on Reinforcement Learning, Multi-Agent Systems, Agentic AI, or Distributed AI.
Strong expertise in reinforcement learning algorithms, particularly in multi-agent settings (e.g., policy gradients, value-based methods, CTDE, credit assignment, and coordination in non-stationary environments).
Solid foundations in optimization, probability, and game theory, with the ability to design and analyze complex learning systems.
Experience building scalable RL training infrastructure, including distributed rollouts, large-scale simulation, and experiment pipelines.
Strong programming skills in Python and/or C++, with experience developing high-performance or distributed ML systems.
Demonstrated impact through research publications, open-source contributions, patents, or production ML systems in reinforcement learning, multi-agent learning, or large-scale AI systems.

Additional Information:

Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.

All applications for this position are reviewed directly by our hiring team, we do not use artificial intelligence tools to screen or select candidates.

19 Allstate Pky, Markham, Ontario, Canada, L3R 5A4

Similar Jobs

Huawei Canada

Senior Principal Researcher & Technical Leader – Agentic RL for Distributed Computing

12 Days Ago

In-Office

Markham, ON, CAN

Senior level

Information Technology • Other

Lead architecture design and technology selection for foundation model applications, driving innovation and efficiency for AI developers, and connecting business and academic resources.

Top Skills: C++DifyGoJavaLangchainPythonPyTorchTensorFlow

ServiceNow

Principal Customer Success Executive

27 Minutes Ago

Remote or Hybrid

Toronto, ON, CAN

Expert/Leader

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation

Lead customer retention and adoption for ServiceNow customers by identifying churn risk, partnering with Sales on adoption/retention plans, advising on governance and SLA issues, and improving customer satisfaction through consulting, project oversight, and executive engagement.

Top Skills: AIAi-Powered ToolsServicenow

HiBob

Senior Back-end Engineer

46 Minutes Ago

Remote or Hybrid

Canada

Senior level

HR Tech • Information Technology • Professional Services • Sales • Software

Design, develop, and maintain scalable backend systems for the Payroll product using a microservices architecture. Own the full development lifecycle from technical design to deployment and monitoring, collaborate with product and front-end teams, build and optimize APIs, and work in a continuous delivery environment with automated QA and testing practices.

Top Skills: APIsAutomated QaAWSContinuous DeliveryJavaKotlinMicroservicesMockingMonitoringMySQLPostgresScalaTddUnit Testing

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Huawei Canada

Agentic RL Researcher – Distributed Computing

Huawei Canada Markham, Ontario, CAN Office

Similar Jobs

Senior Principal Researcher & Technical Leader – Agentic RL for Distributed Computing

Principal Customer Success Executive

Senior Back-end Engineer

What you need to know about the Toronto Tech Scene