Generating and providing proposed digital actions in high-dimensional action spaces using reinforcement learning models

Invention Grant

US12288074B2 Generating and providing proposed digital actions in high-dimensional action spaces using reinforcement learning models 有权

Please log in to see more content

Patent Title: Generating and providing proposed digital actions in high-dimensional action spaces using reinforcement learning models
Application No.: US16261092

Application Date: 2019-01-29
Publication No.: US12288074B2

Publication Date: 2025-04-29
Inventor: Yash Chandak , Georgios Theocharous
Applicant: Adobe Inc.
Applicant Address: US CA San Jose
Assignee: Adobe Inc.
Current Assignee: Adobe Inc.
Current Assignee Address: US CA San Jose
Agency: Keller Preece PLLC
Main IPC: G06F9/38
IPC: G06F9/38 ; G06F9/48 ; G06N3/08 ; G06N20/00

Generating and providing proposed digital actions in high-dimensional action spaces using reinforcement learning models

Abstract:

The present disclosure relates to generating proposed digital actions in high-dimensional action spaces for client devices utilizing reinforcement learning models. For example, the disclosed systems can utilize a supervised machine learning model to train a latent representation decoder to determine proposed digital actions based on latent representations. Additionally, the disclosed systems can utilize a latent representation policy gradient model to train a state-based latent representation generation policy to generate latent representations based on the current state of client devices. Subsequently, the disclosed systems can identify the current state of a client device and a plurality of available actions, utilize the state-based latent representation generation policy to generate a latent representation based on the current state, and utilize the latent representation decoder to determine a proposed digital action from the plurality of available actions by analyzing the latent representation.

Public/Granted literature

US20200241878A1 GENERATING AND PROVIDING PROPOSED DIGITAL ACTIONS IN HIGH-DIMENSIONAL ACTION SPACES USING REINFORCEMENT LEARNING MODELS Public/Granted day:2020-07-30

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F9/00	程序控制装置，例如，控制单元（用于外部设备的程序控制入G06F13/10）
G06F9/06	.应用存入的程序的，即应用处理设备的内部存储来接收程序并保持程序的
G06F9/30	..与执行机器指令相关的设计，例如指令译码（用于执行微指令的入G06F9/22；）
G06F9/38	...并行执行指令的，例如，流水线、超前锁定