System and method for deep reinforcement learning

Invention Grant

US11574148B2 System and method for deep reinforcement learning 有权

Please log in to see more content

Patent Title: System and method for deep reinforcement learning
Application No.: US16674932

Application Date: 2019-11-05
Publication No.: US11574148B2

Publication Date: 2023-02-07
Inventor: Bilal Kartal , Pablo Francisco Hernandez Leal , Matthew Edmund Taylor
Applicant: ROYAL BANK OF CANADA
Applicant Address: CA Toronto
Assignee: ROYAL BANK OF CANADA
Current Assignee: ROYAL BANK OF CANADA
Current Assignee Address: CA Toronto
Agency: Norton Rose Fulbright Canada LLP
Main IPC: G06K9/62
IPC: G06K9/62 ; G06N3/04 ; G06N3/08

System and method for deep reinforcement learning

Abstract:

A computer system and method for extending parallelized asynchronous reinforcement learning for training a neural network is described in various embodiments, through coordinated operation of plurality of hardware processors or threads such that each functions as a worker agent that is configured to simultaneously interact with a target computing environment for local gradient computation based on a loss determination and to update global network parameters based at least on local gradient computation to train the neural network through modifications of weighted interconnections between interconnected computing units as gradient computation is conducted across a plurality of iterations of a target computing environment, the loss determination including at least a policy loss term (actor), a value loss term (critic), and an auxiliary control loss. Variations are described further where the neural network is adapted to include terminal state prediction and action guidance.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )
G06K9/62	.应用电子设备进行识别的方法或装置