Constraining actions for reinforcement learning under safety requirements

Invention Grant

US11468310B2 Constraining actions for reinforcement learning under safety requirements 有权

Please log in to see more content

Patent Title: Constraining actions for reinforcement learning under safety requirements
Application No.: US15914240

Application Date: 2018-03-07
Publication No.: US11468310B2

Publication Date: 2022-10-11
Inventor: Tu-Hoa Pham , Giovanni De Magistris , Ryuki Tachibana
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agency: Tutunjian & Bitetto, P.C.
Agent Randall Bluestone
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N5/04

Constraining actions for reinforcement learning under safety requirements

Abstract:

A computer-implemented method, computer program product, and system are provided for deep reinforcement learning to control a subject device. The method includes training, by a processor, a neural network to receive state information of a target of the subject device as an input and provide action information for the target as an output. The method further includes inputting, by the processor, current state information of the target into the neural network to obtain current action information for the target. The method also includes correcting, by the processor, the current action information minimally to obtain corrected action information that meets a set of constraints. The method additionally includes performing an action by the subject device based on the corrected action information for the target to obtain a reward from the target.

Public/Granted literature

US20190279081A1 CONSTRAINING ACTIONS FOR REINFORCEMENT LEARNING UNDER SAFETY REQUIREMENTS Public/Granted day:2019-09-12

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法