Method and apparatus for reinforcement learning training sessions with consideration of resource costing and resource utilization

Invention Grant

US11748611B2 Method and apparatus for reinforcement learning training sessions with consideration of resource costing and resource utilization 有权

Please log in to see more content

Patent Title: Method and apparatus for reinforcement learning training sessions with consideration of resource costing and resource utilization
Application No.: US16278699

Application Date: 2019-02-18
Publication No.: US11748611B2

Publication Date: 2023-09-05
Inventor: Sumit Sanyal , Anil Hebbar , Abdul Puliyadan Kunnil Muneer , Abhinav Kaushik , Bharat Kumar Padi , Jeroen Bédorf , Tijmen Tieleman
Applicant: Sumit Sanyal , Anil Hebbar , Abdul Puliyadan Kunnil Muneer , Abhinav Kaushik , Bharat Kumar Padi , Jeroen Bédorf , Tijmen Tieleman
Applicant Address: US CA Santa Cruz
Assignee: Sumit Sanyal,Anil Hebbar,Abdul Puliyadan Kunnil Muneer,Abhinav Kaushik,Bharat Kumar Padi,Jeroen Bédorf,Tijmen Tieleman
Current Assignee: Sumit Sanyal,Anil Hebbar,Abdul Puliyadan Kunnil Muneer,Abhinav Kaushik,Bharat Kumar Padi,Jeroen Bédorf,Tijmen Tieleman
Current Assignee Address: US CA Santa Cruz
Agent Patrick Reilly
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N5/022 ; G06Q10/0631 ; G06N3/045

Method and apparatus for reinforcement learning training sessions with consideration of resource costing and resource utilization

Abstract:

Reinforcement learning enables a framework of information technology assets that include software elements, computational hardware assets, and/or, bundled software and computational hardware systems and products. The performance of successive sessions of an inner loop reinforcement learning is directed and monitored by an outer loop reinforcement learning wherein the outer loop reinforcement learning is designed to reduce financial costs and computational asset requirements and/or optimize learning time in successive instantiations of inner loop reinforcement learning training sessions. The framework enables consideration of the license costs of domain specific simulators, the usage cost of hardware platforms, and the progress of a particular reinforcement learning training. The framework further enables reductions of these costs to orchestrate and train a neural network under budget constraints with respect to the available hardware and software licenses available at runtime. These improvements and optimizations may be performed by using heuristics and neural network algorithms.

Public/Granted literature

US20200265302A1 METHOD AND APPARATUS FOR REINFORCEMENT LEARNING TRAINING SESSIONS WITH CONSIDERATION OF RESOURCE COSTING AND RESOURCE UTILIZATION Public/Granted day:2020-08-20

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法