Invention Grant
- Patent Title: Probabilistic neural network architecture generation
-
Application No.: US18107612Application Date: 2023-02-09
-
Publication No.: US12079726B2Publication Date: 2024-09-03
- Inventor: Nicolo Fusi , Francesco Paolo Casale , Jonathan Gordon
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Main IPC: G06N3/082
- IPC: G06N3/082 ; G06F18/21 ; G06F18/214 ; G06N3/047 ; G06N3/08

Abstract:
Examples of the present disclosure describe systems and methods for probabilistic neural network architecture generation. In an example, an underlying distribution over neural network architectures based on various parameters is sampled using probabilistic modeling. Training data is evaluated in order to iteratively update the underlying distribution, thereby generating a probability distribution over the neural network architectures. The distribution is iteratively trained until the parameters associated with the neural network architecture converge. Once it is determined that the parameters have converged, the resulting probability distribution may be used to generate a resulting neural network architecture. As a result, intermediate architectures need not be fully trained, which dramatically reduces memory usage and/or processing time. Further, in some instances, it is possible to evaluate bigger architectures and/or larger batch sizes while also reducing neural network architecture generation time and maintaining or improving neural network accuracy.
Public/Granted literature
- US20230186094A1 PROBABILISTIC NEURAL NETWORK ARCHITECTURE GENERATION Public/Granted day:2023-06-15
Information query