Huawei Canada Logo

Huawei Canada

AI Systems Engineer – Serverless Distributed Computing

Posted 3 Days Ago
Be an Early Applicant
In-Office
Markham, ON, CAN
Mid level
In-Office
Markham, ON, CAN
Mid level
Develop frameworks for serverless computing optimized for AI workloads, analyze AI system performance, and evaluate emerging technologies.
The summary above was generated by AI

Huawei Canada has an immediate permanent opening for a Software Engineer.
About the team: 

The Distributed Data Storage and Management Lab leads research in distributed data systems, aiming to develop next-generation cloud serverless products that encompass core infrastructure and databases. This lab addresses various data challenges, including cloud-native disaggregated databases, pay-by-query user models, and optimizing low-level data transfers via RDMA. Teams within this lab create advanced cloud serverless data infrastructure and implement cutting-edge networking technologies for Huawei's global AI infrastructure.


About the job:

  • Architect and develop frameworks and engines for next-generation serverless computing tailored to AI workloads (LLM training/inference, agent execution, RL training, etc.).

  • Analyze and optimize end-to-end AI system performance, including distributed scheduling, data flow, and memory utilization across large clusters.

  • Research and evaluate cutting-edge technologies in distributed computing, serverless infrastructure, reinforcement learning, and LLM-based AI agents.

  • Collaborate cross-functionally with research, product, and platform teams to transform conceptual AI agent or RL research into scalable production systems.

  • Contribute thought leadership through innovation, technical presentations, and patent generation.

  • Stay ahead of industry trends, assessing emerging tools and frameworks (e.g., Ray, SkyPilot, vLLM, DeepSpeed, Mojo, etc.) to inform team.

The total target annual compensation for this position ranges from $127,000 to $225,000 depending on education, experience, and demonstrated expertise.

About the ideal candidate:

  • PhD with research background in LLM systems, RL, AI agents, or distributed computing, or MS in Computer Science, Electrical Engineering, or related field with 3–4 years of AI industry experience.

  • Strong system design and software engineering skills, including experience with C++ or Python, concurrency, performance tuning, and large-scale distributed systems.

  • Proven expertise in one or more of the following areas:

    o   AI system architecture — LLM training/inference pipeline optimization, multi-agent orchestration, or reinforcement learning frameworks.

    o   Serverless / distributed infrastructure — autoscaling, resource scheduling, fault recovery, or cloud-native microservices.

  • Ability to lead complex technical projects, mentor peers, and deliver solutions with measurable impact.

  • Publications, open-source contributions, or patents in AI systems, RL, or distributed computing is an asset.

  • Familiarity with GPU cluster management, model parallelism, or memory-optimized inference (e.g., KVCache, offloading strategies) is an asset.

  • Demonstrated ability to bridge research and engineering, bringing experimental AI methods into production-grade systems is an asset.

Additional Information:

Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.

All applications for this position are reviewed directly by our hiring team, we do not use artificial intelligence tools to screen or select candidates.

Top Skills

C++
Deepspeed
Mojo
Python
Ray
Rdma
Skypilot
Vllm
HQ

Huawei Canada Markham, Ontario, CAN Office

19 Allstate Pky, Markham, Ontario, Canada, L3R 5A4

Similar Jobs

35 Minutes Ago
Hybrid
Toronto, ON, CAN
Mid level
Mid level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
As the Manager of Talent, you will lead talent strategies, including succession planning, talent assessments, and development initiatives for the Canada business unit, while acting as a strategic advisor and driving engagement and inclusion.
Top Skills: Employee Experience PlatformsGlintHr Information SystemsWorkday
35 Minutes Ago
Hybrid
Toronto, ON, CAN
Junior
Junior
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
As an Associate Engineer II, you'll support packaging design, conduct trials, and manage project-related activities, ensuring quality and consumer satisfaction.
Top Skills: Chemical EngineeringLean Six SigmaMechanical EngineeringMinitabPackaging Science
35 Minutes Ago
Remote or Hybrid
East York, ON, CAN
Mid level
Mid level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Analyze financial data, perform budgeting and forecasting, and provide insights to support strategic initiatives. Lead monthly close activities and collaborate with cross-functional teams to improve financial performance and ensure compliance.
Top Skills: Advanced ExcelPower BISAP

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account