Unsupervised learning techniques for temporal difference models

Invention Grant

US10311339B2 Unsupervised learning techniques for temporal difference models 有权

Please log in to see more content

Patent Title: Unsupervised learning techniques for temporal difference models
Application No.: US15432045

Application Date: 2017-02-14
Publication No.: US10311339B2

Publication Date: 2019-06-04
Inventor: Bryan Andrew Seybold
Applicant: Google Inc.
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Dority & Manning, P.A.
Main IPC: G06K9/00
IPC: G06K9/00 ; G06K9/62 ; G06N3/04 ; G06N3/08

Unsupervised learning techniques for temporal difference models

Abstract:

A temporal difference model can be trained to receive at least a first state representation and a second state representation that respectively describe a state of an object at two different times and, in response, output a temporal difference representation that encodes changes in the object between the two different times. To train the model, the temporal difference model can be combined with a prediction model that, given the temporal difference representation and the first state representation, seeks to predict or otherwise reconstruct the second state representation. The temporal difference model can be trained on a loss value that represents a difference between the second state representation and the prediction of the second state representation. In such fashion, unlabeled data can be used to train the temporal difference model to provide a temporal difference representation. The present disclosure further provides example uses for such temporal difference models once trained.

Public/Granted literature

US20180232604A1 Unsupervised Learning Techniques for Temporal Difference Models Public/Granted day:2018-08-16

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )