Offline agent using reinforcement learning to speedup trajectory planning for autonomous vehicles

Invention Grant

US11493926B2 Offline agent using reinforcement learning to speedup trajectory planning for autonomous vehicles 有权

Please log in to see more content

Patent Title: Offline agent using reinforcement learning to speedup trajectory planning for autonomous vehicles
Application No.: US16413339

Application Date: 2019-05-15
Publication No.: US11493926B2

Publication Date: 2022-11-08
Inventor: Runxin He , Jinyun Zhou , Qi Luo , Shiyu Song , Jinghao Miao , Jiangtao Hu , Yu Wang , Jiaxuan Xu , Shu Jiang
Applicant: Baidu USA LLC
Applicant Address: US CA Sunnyvale
Assignee: Baidu USA LLC
Current Assignee: Baidu USA LLC
Current Assignee Address: US CA Sunnyvale
Agency: Womble Bond Dickinson (US) LLP
Main IPC: G05D1/02
IPC: G05D1/02 ; G06N3/08 ; G06N3/04

Offline agent using reinforcement learning to speedup trajectory planning for autonomous vehicles

Abstract:

In one embodiment, a system generates a plurality of driving scenarios to train a reinforcement learning (RL) agent and replays each of the driving scenarios to train the RL agent by: applying a RL algorithm to an initial state of a driving scenario to determine a number of control actions from a number of discretized control/action options for the ADV to advance to a number of trajectory states which are based on a number of discretized trajectory state options, determining a reward prediction by the RL algorithm for each of the controls/actions, determining a judgment score for the trajectory states, and updating the RL agent based on the judgment score.

Information query

Espacenet

IPC分类:

G	物理
G05	控制；调节
G05D	非电变量的控制或调节系统（金属的连续铸造入B22D11/16；阀门本身入F16K；非电变量的检测见G01各有关小类；电或磁变量的调节入G05F）
G05D1/00	陆地、水上、空中或太空中的运载工具的位置、航道、高度或姿态的控制，例如自动驾驶仪（无线电导航系统或使用其他波的类似系统入G01S）
G05D1/02	.二维的位置或航道控制