Invention Grant
- Patent Title: Unsupervised learning techniques for temporal difference models
-
Application No.: US15432045Application Date: 2017-02-14
-
Publication No.: US10311339B2Publication Date: 2019-06-04
- Inventor: Bryan Andrew Seybold
- Applicant: Google Inc.
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Dority & Manning, P.A.
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/62 ; G06N3/04 ; G06N3/08

Abstract:
A temporal difference model can be trained to receive at least a first state representation and a second state representation that respectively describe a state of an object at two different times and, in response, output a temporal difference representation that encodes changes in the object between the two different times. To train the model, the temporal difference model can be combined with a prediction model that, given the temporal difference representation and the first state representation, seeks to predict or otherwise reconstruct the second state representation. The temporal difference model can be trained on a loss value that represents a difference between the second state representation and the prediction of the second state representation. In such fashion, unlabeled data can be used to train the temporal difference model to provide a temporal difference representation. The present disclosure further provides example uses for such temporal difference models once trained.
Public/Granted literature
- US20180232604A1 Unsupervised Learning Techniques for Temporal Difference Models Public/Granted day:2018-08-16
Information query