Bot Auto Logo

Bot Auto

ML/RL Engineer, Behavior Planning

Posted 3 Days Ago
In-Office or Remote
Hiring Remotely in CA
Senior level
In-Office or Remote
Hiring Remotely in CA
Senior level
Develop and train conditioned policies and MARL systems to simulate realistic driving behaviors, implement safety-constrained RL algorithms, design rewards and evaluation metrics, optimize large-scale training pipelines, advance neural architectures for long-horizon planning and spatial reasoning, and integrate research models with production simulation and planning teams.
The summary above was generated by AI
Company Introduction

At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a startup and the wisdom of seasoned experts, our team has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create groundbreaking solutions that propel the future of transportation. Join us and transform your ideas into reality.

Role Overview

We are seeking a ML/RL Engineer to join our Algo team and drive the development of our unified behavioral architecture. In this role, you will help bridge the gap between simulation and the real world by developing a scalable policy framework that represents both our L4 ego-policy and a diverse population of simulated agents. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure our autonomous semi-trucks navigate highways with superhuman safety and precision.

Key Responsibilities
  • Behavioral Modeling: Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate our autonomous driving stack.
  • Safety-Constrained Learning: Lead the research and implementation of advanced RL algorithms to ensure safety metrics are treated as primary constraints in the learning process.
  • Reward & Objective Design: Collaborate with cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort.
  • Scalable Training Pipelines: Contribute to the optimization of our large-scale, high-throughput training environments to enable rapid iteration on complex multi-agent scenarios.
  • Model Architecture: Advance our state-of-the-art neural architectures to improve spatial reasoning, long-horizon planning, and interaction modeling.
  • Cross-Team Collaboration: Work closely with Simulation and Planning teams to integrate research-grade models into production-quality, safety-critical software.
Required Qualifications
  • Professional RL Experience: Proven track record of training and deploying deep RL algorithms (e.g., PPO, SAC) for complex, real-world robotic or autonomous systems.
  • Technical Mastery: Expertise in Python and PyTorch; strong understanding of modern deep learning architectures and optimization techniques.
  • Academic Background: MS or PhD in Computer Science, Robotics, or a related quantitative field.
  • Scientific Intuition: Ability to diagnose and solve fundamental challenges in RL training, such as variance management and distribution shift.
Preferred Qualifications
  • Safe RL Specialization: Experience with constrained optimization or safety-critical learning frameworks.
  • Multi-Agent Systems: Background in MARL training stability, including self-play and decentralized execution strategies.
  • Autonomous Driving Domain: Familiarity with vehicle dynamics and behavior planning, particularly for long-haul highway environments.
Additional Information
  • Compensation: Competitive salary based on experience, with opportunities for performance bonuses and equity.
  • Benefits: Comprehensive health insurance, paid time off, and the opportunity to work at the forefront of the autonomous trucking industry.

Similar Jobs

3 Days Ago
In-Office or Remote
CA
Senior level
Senior level
Logistics • Transportation
The Senior ML/RL Engineer will develop behavior models, implement RL algorithms focusing on safety, design reward functions and optimize training environments while collaborating across teams.
Top Skills: PythonPyTorch
13 Minutes Ago
Remote
Ontario, ON, CAN
Senior level
Senior level
Cloud • Fintech • Food • Information Technology • Software • Hospitality
Serve as customers' primary advisor for a book of mid-market and complex accounts. Drive retention, ARR growth, product activation and adoption, perform demos, manage escalations, provide VoC feedback, and partner with sales, product, and onboarding to support international growth.
Top Skills: Google SuiteMS OfficeSalesforce CRMSlackToast
An Hour Ago
Remote
2 Locations
Senior level
Senior level
Fintech • Financial Services
Lead BI analyst leveraging a modern data stack to deliver dashboards, automated reports, and AI-driven insights. Partner with stakeholders to define data needs, build Tableau visualizations, write complex SQL in Snowflake, and integrate Snowflake Intelligence/Claude to automate workflows and enable self-service analytics across the company.
Top Skills: CensusClaudeDbtExcelFivetranGitPythonSalesforceSnowflakeSnowflake IntelligenceSQLTableau

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account