Learning method for learning action of agent using model-based reinforcement learning

Invention Grant

US11651282B2 Learning method for learning action of agent using model-based reinforcement learning 有权

Please log in to see more content

Patent Title: Learning method for learning action of agent using model-based reinforcement learning
Application No.: US16918390

Application Date: 2020-07-01
Publication No.: US11651282B2

Publication Date: 2023-05-16
Inventor: Masashi Okada
Applicant: Panasonic Intellectual Property Corporation of America
Applicant Address: US CA Torrance
Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
Current Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
Current Assignee Address: US CA Torrance
Agency: Wenderoth, Lind & Ponack, L.L.P.
Priority: JP 2020053613 2020.03.25
Main IPC: G06N20/00
IPC: G06N20/00 ; G06N5/043

Learning method for learning action of agent using model-based reinforcement learning

Abstract:

A learning method for learning an action of an agent using model-based reinforcement learning is provided. The learning method includes: obtaining time series data indicating states and actions of the agent when the agent performs a series of actions; establishing a dynamics model by performing supervised learning using the time series data obtained; deriving a plurality of candidates for an action sequence of the agent from variational inference using a mixture model as a variational distribution, based on the dynamics model; and outputting, as the action sequence of the agent, one candidate selected from among the plurality of candidates derived.

Public/Granted literature

US20210004717A1 LEARNING METHOD AND RECORDING MEDIUM Public/Granted day:2021-01-07

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习