Reinforcement learning with a stochastic action set

Invention Grant

US11615293B2 Reinforcement learning with a stochastic action set 有权

Please log in to see more content

Patent Title: Reinforcement learning with a stochastic action set
Application No.: US16578863

Application Date: 2019-09-23
Publication No.: US11615293B2

Publication Date: 2023-03-28
Inventor: Georgios Theocharous , Yash Chandak
Applicant: ADOBE INC.
Applicant Address: US CA San Jose
Assignee: ADOBE INC.
Current Assignee: ADOBE INC.
Current Assignee Address: US CA San Jose
Agency: F. Chau & Associates, LLC
Main IPC: G06N3/04
IPC: G06N3/04 ; G06N3/08

Reinforcement learning with a stochastic action set

Abstract:

Systems and methods are described for a decision-making process including actions characterized by stochastic availability, provide an Markov decision process (MDP) model that includes a stochastic action set based on the decision-making process, compute a policy function for the MDP model using a policy gradient based at least in part on a function representing the stochasticity of the stochastic action set, identify a probability distribution for one or more actions available at a time period using the policy function, and select an action for the time period based on the probability distribution.

Public/Granted literature

US20210089868A1 REINFORCEMENT LEARNING WITH A STOCHASTIC ACTION SET Public/Granted day:2021-03-25

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑