Invention Grant
- Patent Title: Learning method for learning action of agent using model-based reinforcement learning
-
Application No.: US16918390Application Date: 2020-07-01
-
Publication No.: US11651282B2Publication Date: 2023-05-16
- Inventor: Masashi Okada
- Applicant: Panasonic Intellectual Property Corporation of America
- Applicant Address: US CA Torrance
- Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
- Current Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
- Current Assignee Address: US CA Torrance
- Agency: Wenderoth, Lind & Ponack, L.L.P.
- Priority: JP 2020053613 2020.03.25
- Main IPC: G06N20/00
- IPC: G06N20/00 ; G06N5/043

Abstract:
A learning method for learning an action of an agent using model-based reinforcement learning is provided. The learning method includes: obtaining time series data indicating states and actions of the agent when the agent performs a series of actions; establishing a dynamics model by performing supervised learning using the time series data obtained; deriving a plurality of candidates for an action sequence of the agent from variational inference using a mixture model as a variational distribution, based on the dynamics model; and outputting, as the action sequence of the agent, one candidate selected from among the plurality of candidates derived.
Public/Granted literature
- US20210004717A1 LEARNING METHOD AND RECORDING MEDIUM Public/Granted day:2021-01-07
Information query