Reinforcement learning method and reinforcement learning system

Invention Grant

US11619915B2 Reinforcement learning method and reinforcement learning system 有权

Please log in to see more content

Patent Title: Reinforcement learning method and reinforcement learning system
Application No.: US16797573

Application Date: 2020-02-21
Publication No.: US11619915B2

Publication Date: 2023-04-04
Inventor: Hidenao Iwane , Junichi Shigezumi , Yoshihiro Okawa , Tomotake Sasaki , Hitoshi Yanami
Applicant: FUJITSU LIMITED
Applicant Address: JP Kawasaki
Assignee: FUJITSU LIMITED
Current Assignee: FUJITSU LIMITED
Current Assignee Address: JP Kawasaki
Agency: Xsensus LLP
Priority: JPJP2019-039031 20190304
Main IPC: G05B13/02
IPC: G05B13/02 ; G06N20/00 ; B25J9/16 ; H02J3/38

Reinforcement learning method and reinforcement learning system

Abstract:

A computer-implemented reinforcement learning method includes determining, based on a target probability of satisfaction of a constraint condition related to a state of a control object and a specific time within which a controller causes the state of the control object not satisfying the constraint condition to be the state of the control object satisfying the constraint condition, a parameter of a reinforcement learner that causes, in a specific probability, the state of the control object to satisfy the constraint condition at a first timing following a second timing at which the state of control object satisfies the constraint condition; and determining a control input to the control object by either the reinforcement learner or the controller, based on whether the state of the control object satisfies the constraint condition at a specific timing.

Public/Granted literature

US20200285204A1 REINFORCEMENT LEARNING METHOD AND REINFORCEMENT LEARNING SYSTEM Public/Granted day:2020-09-10

Information query

Espacenet

IPC分类:

G	物理
G05	控制；调节
G05B	一般的控制或调节系统；这种系统的功能单元；用于这种系统或单元的监视或测试装置（应用流体作用的一般流体压力执行器或系统入F15B；阀门本身入F16K；仅按机械特征区分的入G05G；传感元件见相应小类，例如G12B，G01、H01的小类；校正单元见相应的小类，例如H02K）
G05B13/00	自适应控制系统，即系统按照一些预定的准则自动调整自己使之具有最佳性能的系统（G05B19/00优先；机器学习G06N 20/00）
G05B13/02	.电的