Invention Grant
- Patent Title: Training an unsupervised memory-based prediction system to learn compressed representations of an environment
-
Application No.: US16766945Application Date: 2019-03-11
-
Publication No.: US12159221B2Publication Date: 2024-12-03
- Inventor: Gregory Duncan Wayne , Chia-Chun Hung , David Antony Amos , Mehdi Mirza Mohammadi , Arun Ahuja , Timothy Paul Lillicrap
- Applicant: DeepMind Technologies Limited
- Applicant Address: GB London
- Assignee: DeepMind Technologies Limited
- Current Assignee: DeepMind Technologies Limited
- Current Assignee Address: GB London
- Agency: Fish & Richardson P.C.
- International Application: PCT/EP2019/055950 WO 20190311
- International Announcement: WO2019/170905 WO 20190912
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/045

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a memory-based prediction system configured to receive an input observation characterizing a state of an environment interacted with by an agent and to process the input observation and data read from a memory to update data stored in the memory and to generate a latent representation of the state of the environment. The method comprises: for each of a plurality of time steps: processing an observation for the time step and data read from the memory to: (i) update the data stored in the memory, and (ii) generate a latent representation of the current state of the environment as of the time step; and generating a predicted return that will be received by the agent as a result of interactions with the environment after the observation for the time step is received.
Public/Granted literature
Information query