Distributional reinforcement learning for continuous control tasks

Invention Grant

US11481629B2 Distributional reinforcement learning for continuous control tasks 有权

Please log in to see more content

Patent Title: Distributional reinforcement learning for continuous control tasks
Application No.: US16759519

Application Date: 2018-10-29
Publication No.: US11481629B2

Publication Date: 2022-10-25
Inventor: David Budden , Matthew William Hoffman , Gabriel Barth-Maron
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
Agency: Fish & Richardson P.C.
International Application: PCT/EP2018/079526 WO 20181029
International Announcement: WO2019/081778 WO 20190502
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/04

Distributional reinforcement learning for continuous control tasks

Abstract:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection neural network that is used to select actions to be performed by a reinforcement learning agent interacting with an environment. In particular, the actions are selected from a continuous action space and the system trains the action selection neural network jointly with a distribution Q network that is used to update the parameters of the action selection neural network.

Public/Granted literature

US20200293883A1 DISTRIBUTIONAL REINFORCEMENT LEARNING FOR CONTINUOUS CONTROL TASKS Public/Granted day:2020-09-17

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法