Invention Grant
- Patent Title: Training neural networks using synthetic gradients
-
Application No.: US16303595Application Date: 2017-05-19
-
Publication No.: US11715009B2Publication Date: 2023-08-01
- Inventor: Oriol Vinyals , Alexander Benjamin Graves , Wojciech Czarnecki , Koray Kavukcuoglu , Simon Osindero , Maxwell Elliot Jaderberg
- Applicant: DEEPMIND TECHNOLOGIES LIMITED
- Applicant Address: GB London
- Assignee: DeepMind Technologies Limited
- Current Assignee: DeepMind Technologies Limited
- Current Assignee Address: GB London
- Agency: Fish & Richardson P.C.
- International Application: PCT/US2017/033697 2017.05.19
- International Announcement: WO2017/201506A 2017.11.23
- Date entered country: 2018-11-20
- Main IPC: G06N3/084
- IPC: G06N3/084 ; G06N3/044 ; G06N3/045

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network including a first subnetwork followed by a second subnetwork on training inputs by optimizing an objective function. In one aspect, a method includes processing a training input using the neural network to generate a training model output, including processing a subnetwork input for the training input using the first subnetwork to generate a subnetwork activation for the training input in accordance with current values of parameters of the first subnetwork, and providing the subnetwork activation as input to the second subnetwork; determining a synthetic gradient of the objective function for the first subnetwork by processing the subnetwork activation using a synthetic gradient model in accordance with current values of parameters of the synthetic gradient model; and updating the current values of the parameters of the first subnetwork using the synthetic gradient.
Public/Granted literature
- US20200320396A1 TRAINING NEURAL NETWORKS USING SYNTHETIC GRADIENTS Public/Granted day:2020-10-08
Information query