Senior/Staff/Principal Engineer – Edge AI LLM 9291

Posted 6 Days Ago
Be an Early Applicant
Toronto, ON
Hybrid
5-7 Years Experience
Software
The Role
Seeking a talented Edge AI Principal Engineer with expertise in GPU/TPU acceleration and local Large Language Models (LLM) inference. Responsibilities include high-level design, AI model optimization, team leadership, and collaboration on edge computing platforms.
Summary Generated by Built In

Senior/Staff/Principal Engineer – Edge AI LLM 9291


We are seeking a talented Senior/Staff/Principal Engineer with specialized expertise in GPU/TPU acceleration to join our team. The ideal candidate will have extensive hands-on experience in local Large Language Models (LLM) inference with embedded GPU/TPU architectures. As Principal Engineer specializing in Edge AI, you will play a crucial role in shaping the future Edge AI solution, leveraging the power of GPU/TPU acceleration and enterprise grade, large scale edge compute.

 

The successful candidate will combine technical excellence with effective leadership, creating a positive impact on both projects and team dynamics.

Key Responsibilities:

  • High-Level Design and Architecture
  • Influence the Edge AI strategy by providing expert advice on design and architecture.
  • Make critical decisions regarding technical directions, scalability, and system performance.
  • Develop and optimize AI inference models for deployment on edge devices with embedded GPU/TPU accelerators, focusing on local Low Latency Model (LLM) inference.
  • Implement and fine-tune low-latency model inference pipelines to meet real-time performance requirements.
  • Collaborate with cross-functional teams to integrate AI inference solutions into edge computing platforms and applications.
  • Collaborate with the GPU Hardware Design Team to design and optimize GPUs that power next-generation devices.
  • Conduct performance profiling and optimization to maximize the efficiency of GPU/TPU acceleration for local LLM inference.
  • Work on micro-architecture development, ensuring efficient execution of graphics, compute, and AI workloads within energy and area constraints.
  • Stay current with advancements in GPU/TPU technologies and edge AI frameworks, incorporating them into solution designs as appropriate.
  • Provide technical expertise and support to project teams, ensuring successful implementation and deployment of edge AI solutions.

Team Leadership:

  • Lead and inspire a team of engineers, providing guidance, setting goals, and ensuring collaboration.
  • Oversee project planning, execution, and delivery, ensuring alignment with business objectives.
  • Manage all phases of technical projects, from conception to completion.
  • Develop project specifications, track progress, and control costs.
  • Foster a positive work environment, encouraging professional growth and knowledge sharing.

Qualifications:

  • Bachelor’s degree in computer science, Engineering, or a related field; Master’s degree preferred.
  • 5+ years of hands-on experience in AI model development and deployment, with a focus on edge computing and local LLM inference.
  • Strong programming skills in languages such as Python and C++
  • Proficiency in LLM frameworks (e.g., vLLM, Text generation inference, OpenLLM, Ray Serve, and HuggingFace Transformers) and deep learning libraries.
  • Extensive experience with GPU/TPU acceleration for AI inference, including optimization techniques (tensor, pipeline, data, sharded data parallelism) and performance tuning,
  • Hands on experience with one or more GPU frameworks: CUDA, Vulkan, OpenCL
  • Deep knowledge of GPU memory layout, familiarity with NVIDIA Jatison, ARM Mali or relevant SoC configurations.
  • Knowledge of parallel computation, memory scheduling, and structural optimization
  • Excellent problem-solving and analytical skills, with a passion for innovation and continuous learning.

Additional Skills (Preferred):

  • Experience with edge device hardware and software integration.
  • Familiarity with edge computing architectures and IoT platforms.
  • Experience with edge AI applications in domains such as robotics, autonomous vehicles, or industrial automation.

  • If you are a skilled Edge AI Engineer with a passion for pushing the boundaries of edge computing and GPU/TPU acceleration, particularly in local LLM inference, we want to hear from you! Join us in shaping the future of AI at the edge and revolutionizing industries with innovative edge AI solutions. Apply now to be part of our dynamic and collaborative team!

Top Skills

C++
Python
The Company
Markham, Ontario
3,661 Employees
On-site Workplace
Year Founded: 1996

What We Do

Extreme Networks, Inc. (EXTR) is a leader in cloud networking focused on delivering services that connect devices, applications, and people in new ways. We push the boundaries of technology leveraging the powers of machine learning, artificial intelligence, analytics, and automation. Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions and rely on our top-rated services and support to accelerate their digital transformation efforts and deliver progress like never before. For more information, visit Extreme's website or follow us on Twitter, LinkedIn, and Facebook.

Jobs at Similar Companies

Fusion92 Logo Fusion92

Account Executive

AdTech • Agency • Digital Media • Enterprise Web • Marketing Tech • Analytics • Web3
IL, USA
263 Employees

ForeFlight Logo ForeFlight

Product Designer II

Aerospace • Software • App development
Remote
Austin, TX, USA
466 Employees

IonQ Logo IonQ

Lead Ion Trap Design Engineer

Artificial Intelligence • Hardware • Information Technology • Internet of Things • Software
Easy Apply
Seattle, WA, USA
305 Employees

Snap Inc. Logo Snap Inc.

Application Engineer, Salesforce UI

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
New York, NY, USA
5000 Employees

Similar Companies Hiring

Instacart Thumbnail
Software • Retail • Food • eCommerce
San Francisco, CA
3000 Employees
Toast Thumbnail
Software • Information Technology • Hospitality • Food • Fintech • Cloud
Boston, MA
4500 Employees
Block Thumbnail
Software • Payments • Fintech • Financial Services • eCommerce • Cryptocurrency • Blockchain
Oakland, CA
12000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account