Updating policy parameters under Markov decision process system environment

Invention Grant

US08909571B2 Updating policy parameters under Markov decision process system environment 有权

Please log in to see more content

Patent Title: Updating policy parameters under Markov decision process system environment
Application No.: US13898740

Application Date: 2013-05-21
Publication No.: US08909571B2

Publication Date: 2014-12-09
Inventor: Tetsuro Morimura , Takayuki Osogami , Tomoyuki Shirai
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Cantor Colburn LLP
Priority: JP2012-116440 20120522
Main IPC: G06F15/18
IPC: G06F15/18 ; G06N99/00 ; G06N5/02

Updating policy parameters under Markov decision process system environment

Abstract:

Embodiments relate to updating a parameter defining a policy under a Markov decision process system environment. An aspect includes updating the policy parameter stored in a storage section of a controller according to an update equation. The update equation includes a term for decreasing a weighted sum of expected hitting times over a first state (s) and a second state (s′) of a statistic on the number of steps required to make a first state transition from the first state (s) to the second state (s′).

Public/Granted literature

US20130325764A1 UPDATING POLICY PARAMETERS UNDER MARKOV DECISION PROCESS SYSTEM ENVIRONMENT Public/Granted day:2013-12-05

Information query

Espacenet