Neural episodic control

Invention Grant

US10664753B2 Neural episodic control 有权

Please log in to see more content

Patent Title: Neural episodic control
Application No.: US16445523

Application Date: 2019-06-19
Publication No.: US10664753B2

Publication Date: 2020-05-26
Inventor: Benigno Uria-Martínez , Alexander Pritzel , Charles Blundell , Adria Puigdomenech Badia
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
Agency: Fish & Richardson P.C.
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/00 ; G06N3/04

Abstract:

A method includes maintaining respective episodic memory data for each of multiple actions; receiving a current observation characterizing a current state of an environment being interacted with by an agent; processing the current observation using an embedding neural network in accordance with current values of parameters of the embedding neural network to generate a current key embedding for the current observation; for each action of the plurality of actions: determining the p nearest key embeddings in the episodic memory data for the action to the current key embedding according to a distance measure, and determining a Q value for the action from the return estimates mapped to by the p nearest key embeddings in the episodic memory data for the action; and selecting, using the Q values for the actions, an action from the multiple actions as the action to be performed by the agent.

Public/Granted literature

US20190303764A1 NEURAL EPISODIC CONTROL Public/Granted day:2019-10-03

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法