EXL Logo

EXL

Assistant Vice President-Senior Gen AI Engineer

Posted Yesterday
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Canada
Senior level
Remote or Hybrid
Hiring Remotely in Canada
Senior level
Design, build, and operate production-grade Generative and Agentic AI applications and services. Develop Python backend services, integrate cloud-hosted LLM APIs, implement RAG pipelines and vector search, build multi-agent systems, deploy on major cloud providers, ensure observability, security, and performance, and mentor junior engineers.
The summary above was generated by AI

Background 

We are looking for a Staff Generative AI Engineer to design, build, and ship production-grade Generative AI and Agentic AI applications that delivery business value across the organization. This role is focused on building AI applications and services at scale. You will be responsible for building robust, secure, and highly scalable systems that integrate with leading cloud-based AI services.  

As a senior individual contributor, you will bring deep technical expertise, drive high-quality engineering practices, and serve as a mentor for junior to mid-level engineers. You will work closely with software engineers, data scientists, and product teams to translate business problems into production-grade AI applications and services. 

Responsibilities

Key Responsibilities 

Application and Solution Engineering 

  • Design, build, and ship production-grade Generative and Agentic AI applications and services for internal and external users 
  • Develop high-quality backend services in Python, with strong software engineering rigor around testing, performance, and maintainability 
  • Champion reusability and abstraction in everything you build by designing and building modular, well-abstracted components and libraries  
  • Build multi-agent systems using frameworks such as LangChain, LangGraph, Claude Agent SDK and Google ADK 
  • Integrate with leading LLM and foundation model APIs, including Azure OpenAI, Google Vertex AI, and AWS Bedrock 
  • Design and implement Retrieval-Augmented Generation (RAG) pipelines, including document ingestion, chunking strategies, embeddings, vector search, and re-ranking 
  • Build clean, well-tested RESTful and/or gRPC APIs with a strong focus on reliability, security, and performance 
  • Implement observability, tracing, evaluation, guardrails for Generative and Agentic AI applications 
  • Deploy and operate services on major cloud providers (e.g., GCP, AWS, and Azure) leveraging managed services 
  • Contribute to platform architecture decisions and engineering best practices 
  • Take applications from prototype through production deployment, hardening, and ongoing operation 

Mentorship and Team Development 

  • Mentor and coach junior and mid-level engineers through code reviews, architecture discussions, and pair programming 
  • Foster a culture of engineering excellence, knowledge sharing, and continuous improvement 
  • Participate in technical design reviews and contribute to the professional growth of team members 
Qualifications

Required Qualifications  

  • 10-15 years of professional software engineering experience with at least 3-5 years of experience building AI/ML software products 
  • Bachelor’s degree in Computer Science or a related field (Master’s degree preferred) 
  • Strong proficiency in Python, with deep software engineering fundamentals (abstraction, modularity, system design, testing, performance) 
  • Hands-on experience building and shipping Generative and Agentic AI applications, including LLM integration, prompt engineering, and/or agentic workflows 
  • Practical experience integrating cloud-hosted LLM APIs such as Azure OpenAI, Vertex AI, and/or AWS Bedrock 
  • Experience with agent frameworks (e.g., LangChain, LangGraph, Google ADK, Claude Agent SDK) and vector databases (e.g., Pinecone, Weaviate, pgvector, Open Search, AlloyDB) 
  • Hands-on experience with Google Cloud Platform (GCP), Amazon Web Services (AWS), or Azure 
  • Strong understanding of API design, distributed systems, and cloud-native architecture 
  • Proven track record of taking systems from design through production deployment and operation  

 

Preferred Qualifications  

  • Experience with containerization and orchestration (Docker, Kubernetes) 
  • Knowledge of Generative AI Risk Management frameworks (NIST RFM) 
  • Experience supporting developer platforms or internal tooling 
  • Experience writing design documents or helping define engineering standards 

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account