Deep reinforcement learning for long term rewards in an online connection network

Invention Grant

US11620595B2 Deep reinforcement learning for long term rewards in an online connection network 有权

Please log in to see more content

Patent Title: Deep reinforcement learning for long term rewards in an online connection network
Application No.: US16743486

Application Date: 2020-01-15
Publication No.: US11620595B2

Publication Date: 2023-04-04
Inventor: Siyuan Gao , Yiou Xiao , Parag Agrawal , Aastha Jain
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Schwegman, Lundberg & Woessner, P.A.
Main IPC: G06Q10/06
IPC: G06Q10/06 ; G06N20/00 ; G06Q50/00 ; H04L67/50 ; G06Q10/0631

Deep reinforcement learning for long term rewards in an online connection network

Abstract:

An online connection server is configured to more accurately predict connections for a viewing member of an online connection network. The online connection server may implement a machine-learning model that uses prior interactions by the viewing member to determine those connections that are likely to lead to more substantial interactions with the viewing member. The machine-learning model may be implemented using a reinforcement learning technique, such as a Deep Q network. The online connection server may further implement a state representation module that generates a state from a graph-based embedding of the viewing member profile, where the state is used to train the machine-learning model and determine an optimal candidate to recommend as a connection for the viewing member.

Public/Granted literature

US20210216944A1 DEEP REINFORCEMENT LEARNING FOR LONG TERM REWARDS IN AN ONLINE CONNECTION NETWORK Public/Granted day:2021-07-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06Q	专门适用于行政、商业、金融、管理、监督或预测目的的数据处理系统或方法；其他类目不包含的专门适用于行政、商业、金融、管理、监督或预测目的的处理系统或方法
G06Q10/00	行政；管理
G06Q10/06	.资源、工作流、人员或项目管理，例如组织、规划、调度或分配时间、人员或机器资源；企业规划；组织模型