Invention Grant
- Patent Title: Method and apparatus for reinforcement learning training sessions with consideration of resource costing and resource utilization
-
Application No.: US16278699Application Date: 2019-02-18
-
Publication No.: US11748611B2Publication Date: 2023-09-05
- Inventor: Sumit Sanyal , Anil Hebbar , Abdul Puliyadan Kunnil Muneer , Abhinav Kaushik , Bharat Kumar Padi , Jeroen Bédorf , Tijmen Tieleman
- Applicant: Sumit Sanyal , Anil Hebbar , Abdul Puliyadan Kunnil Muneer , Abhinav Kaushik , Bharat Kumar Padi , Jeroen Bédorf , Tijmen Tieleman
- Applicant Address: US CA Santa Cruz
- Assignee: Sumit Sanyal,Anil Hebbar,Abdul Puliyadan Kunnil Muneer,Abhinav Kaushik,Bharat Kumar Padi,Jeroen Bédorf,Tijmen Tieleman
- Current Assignee: Sumit Sanyal,Anil Hebbar,Abdul Puliyadan Kunnil Muneer,Abhinav Kaushik,Bharat Kumar Padi,Jeroen Bédorf,Tijmen Tieleman
- Current Assignee Address: US CA Santa Cruz
- Agent Patrick Reilly
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N5/022 ; G06Q10/0631 ; G06N3/045

Abstract:
Reinforcement learning enables a framework of information technology assets that include software elements, computational hardware assets, and/or, bundled software and computational hardware systems and products. The performance of successive sessions of an inner loop reinforcement learning is directed and monitored by an outer loop reinforcement learning wherein the outer loop reinforcement learning is designed to reduce financial costs and computational asset requirements and/or optimize learning time in successive instantiations of inner loop reinforcement learning training sessions. The framework enables consideration of the license costs of domain specific simulators, the usage cost of hardware platforms, and the progress of a particular reinforcement learning training. The framework further enables reductions of these costs to orchestrate and train a neural network under budget constraints with respect to the available hardware and software licenses available at runtime. These improvements and optimizations may be performed by using heuristics and neural network algorithms.
Public/Granted literature
Information query