Invention Grant
- Patent Title: Constraining actions for reinforcement learning under safety requirements
-
Application No.: US15914240Application Date: 2018-03-07
-
Publication No.: US11468310B2Publication Date: 2022-10-11
- Inventor: Tu-Hoa Pham , Giovanni De Magistris , Ryuki Tachibana
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Tutunjian & Bitetto, P.C.
- Agent Randall Bluestone
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N5/04

Abstract:
A computer-implemented method, computer program product, and system are provided for deep reinforcement learning to control a subject device. The method includes training, by a processor, a neural network to receive state information of a target of the subject device as an input and provide action information for the target as an output. The method further includes inputting, by the processor, current state information of the target into the neural network to obtain current action information for the target. The method also includes correcting, by the processor, the current action information minimally to obtain corrected action information that meets a set of constraints. The method additionally includes performing an action by the subject device based on the corrected action information for the target to obtain a reward from the target.
Public/Granted literature
- US20190279081A1 CONSTRAINING ACTIONS FOR REINFORCEMENT LEARNING UNDER SAFETY REQUIREMENTS Public/Granted day:2019-09-12
Information query