Method for adaptive exploration to accelerate deep reinforcement learning

Invention Grant

US11886988B2 Method for adaptive exploration to accelerate deep reinforcement learning 有权

Please log in to see more content

Patent Title: Method for adaptive exploration to accelerate deep reinforcement learning
Application No.: US15820768

Application Date: 2017-11-22
Publication No.: US11886988B2

Publication Date: 2024-01-30
Inventor: Sakyasingha Dasgupta
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agency: Tutunjian & Bitetto, P.C.
Agent Robert Richard Aragona
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N7/01

Method for adaptive exploration to accelerate deep reinforcement learning

Abstract:

Adaptive exploration in deep reinforcement learning may be performed by inputting a current time frame of an action and observation sequence sequentially into a function approximator, such as a deep neural network, including a plurality of parameters, the action and observation sequence including a plurality of time frames, each time frame including action values and observation values, approximating a value function using the function approximator based on the current time frame to acquire a current value, updating an action selection policy through exploration based on an ε-greedy strategy using the current value, and updating the plurality of parameters.

Public/Granted literature

US20190156197A1 METHOD FOR ADAPTIVE EXPLORATION TO ACCELERATE DEEP REINFORCEMENT LEARNING Public/Granted day:2019-05-23

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法