Level AI Jobs

Research Intern – Reinforcement Learning (RL) - Onsite

Level AI

Research Intern – Reinforcement Learning (RL) - Onsite

Reposted 8 Days Ago

In-Office or Remote

Hiring Remotely in CA

Internship

In-Office or Remote

Hiring Remotely in CA

Internship

As a Research Intern, you'll build reinforcement learning environments and agents, define reward models using real-world data, and collaborate on deploying learning systems.

The summary above was generated by AI

🚀 Build the next generation of Agentic AI with us

Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agents across the entire customer experience lifecycle.

A core part of this vision is our investment in custom Small Language Models (SLMs)—purpose-built for CX workflows—paired with reinforcement learning systems that continuously improve decision-making in real-world environments.

We’re looking for a Research Intern (Reinforcement Learning) to join us in shaping this future.

What you’ll do

Design and build reinforcement learning environments that model real-world customer interaction workflows.
Design RL agents that learn from these environments using real-world interaction data, rewards, and feedback loops
Define reward models and feedback loops using real-world signals (outcomes and human feedback)
Enable learning from production data by structuring interaction traces into training-ready datasets for offline and online learning
Experiment with multi-agent systems and simulation frameworks for complex coordination and decision-making
Collaborate with engineering and product teams to deploy, evaluate, and iterate on learning systems in production at scale.

What we’re looking for

Currently pursuing (or recently completed) a degree in Computer Science, AI, Machine Learning, or related field
Strong understanding of reinforcement learning fundamentals
Familiarity with RL environments and training libraries such as Verl and Tinker
Strong foundation in probability, math, and optimization
Passion for building real-world AI systems

Nice to have

Experience with RLHF, LLM/SLM fine-tuning, or model alignment
Exposure to agent-based systems or multi-agent RL
Prior research, projects, or publications in RL or applied ML
Experience working with large-scale or production datasets

Why Level AI

Work on production-grade Agentic AI systems used by leading enterprises
Build alongside a team with deep expertise from Amazon, Google, and Meta
Be part of a fast-growing Series C AI company.
Direct exposure to 0→1 AI innovation in CX and decisioning systems

Similar Jobs

Zapier

Automation Strategist (Customer Success)

3 Hours Ago

In-Office or Remote

Canada

Senior level

Artificial Intelligence • Productivity • Software • Automation

The Automation Strategist will guide customers in automating processes, help identify use cases, and promote AI-enabled transformation, focusing on value delivery and relationship building.

Top Skills: AIAutomation

Optum

Technical Engineer - Remote

3 Hours Ago

In-Office or Remote

Junior

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics

The Technical Engineer is responsible for remote support and maintenance of PACS and Cloud-based products, ensuring customer satisfaction through timely issue resolution and communication. Duties include diagnosing technical problems, conducting inspections, generating reports, and performing installations and upgrades.

Top Skills: Cloud StorageDicomDlt TapeGitlabGoogle Transfer AppliancesJIRALinuxMs SqlPacsSan StorageSql PlusUnixVMware

Optum

Director, Program Management - Technology & Software Engineering - Remote

3 Hours Ago

In-Office or Remote

Senior level

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics

The Director leads program management for Enterprise Imaging Engineering, overseeing technology initiatives, team development, and strategic execution across software delivery in a matrixed environment.

Top Skills: Al SolutionsAutomationCloud-Native PlatformsSoftware Engineering

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.