Invention Grant
- Patent Title: Monte-Carlo planning using contextual information
- Patent Title (中): 蒙特卡洛计划使用上下文信息
-
Application No.: US13348993Application Date: 2012-01-12
-
Publication No.: US09047423B2Publication Date: 2015-06-02
- Inventor: Gerald J. Tesauro , Alina Beygelzimer , Richard B. Segal , Mark N. Wegman
- Applicant: Gerald J. Tesauro , Alina Beygelzimer , Richard B. Segal , Mark N. Wegman
- Applicant Address: US NY ARMONK
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY ARMONK
- Agency: Scully, Scott, Murphy & Presser, P.C.
- Agent Daniel P. Morris, Esq.
- Main IPC: G06F17/50
- IPC: G06F17/50

Abstract:
A method, system and computer program product for choosing actions in a state of a planning problem. The system simulates one or more sequences of actions, state transitions and rewards starting from the current state of the planning problem. During the simulation of performing a given action in a given state, a data record is maintained of observed contextual state information, and observed cumulative reward resulting from the action. The system performs a regression fit on the data records, enabling estimation of expected reward as a function of contextual state. The estimations of expected rewards are used to guide the choice of actions during the simulations. Upon completion of all simulations, the top-level action which obtained highest mean reward during the simulations is recommended to be executed in the current state of the planning problem.
Public/Granted literature
- US20130185039A1 MONTE-CARLO PLANNING USING CONTEXTUAL INFORMATION Public/Granted day:2013-07-18
Information query