Invention Grant
- Patent Title: Method for adaptive exploration to accelerate deep reinforcement learning
-
Application No.: US15820768Application Date: 2017-11-22
-
Publication No.: US11886988B2Publication Date: 2024-01-30
- Inventor: Sakyasingha Dasgupta
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Tutunjian & Bitetto, P.C.
- Agent Robert Richard Aragona
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N7/01

Abstract:
Adaptive exploration in deep reinforcement learning may be performed by inputting a current time frame of an action and observation sequence sequentially into a function approximator, such as a deep neural network, including a plurality of parameters, the action and observation sequence including a plurality of time frames, each time frame including action values and observation values, approximating a value function using the function approximator based on the current time frame to acquire a current value, updating an action selection policy through exploration based on an ε-greedy strategy using the current value, and updating the plurality of parameters.
Public/Granted literature
- US20190156197A1 METHOD FOR ADAPTIVE EXPLORATION TO ACCELERATE DEEP REINFORCEMENT LEARNING Public/Granted day:2019-05-23
Information query