Invention Grant
- Patent Title: Updating policy parameters under Markov decision process system environment
-
Application No.: US13898740Application Date: 2013-05-21
-
Publication No.: US08909571B2Publication Date: 2014-12-09
- Inventor: Tetsuro Morimura , Takayuki Osogami , Tomoyuki Shirai
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Cantor Colburn LLP
- Priority: JP2012-116440 20120522
- Main IPC: G06F15/18
- IPC: G06F15/18 ; G06N99/00 ; G06N5/02

Abstract:
Embodiments relate to updating a parameter defining a policy under a Markov decision process system environment. An aspect includes updating the policy parameter stored in a storage section of a controller according to an update equation. The update equation includes a term for decreasing a weighted sum of expected hitting times over a first state (s) and a second state (s′) of a statistic on the number of steps required to make a first state transition from the first state (s) to the second state (s′).
Public/Granted literature
- US20130325764A1 UPDATING POLICY PARAMETERS UNDER MARKOV DECISION PROCESS SYSTEM ENVIRONMENT Public/Granted day:2013-12-05
Information query