Abstract:
A training apparatus 1000 using a method of decoding nerve activity includes: a brain activity detecting device 108 for detecting brain activity at a prescribed area within a brain of a subject; and an output device 130 for presenting neurofeedback information (presentation information) to the subject. A processing device 102 decodes a pattern of cranial nerve activity, generates a reward value based on a degree of similarity of the decoded pattern with respect to a target activation pattern obtained in advance for the event as the object of training, and generates presentation information corresponding to the reward value.
Abstract:
A novel highly-adaptable agent learning machine comprises a plurality of learning modules (3) each including a set of an intensive learning system (1) which works on an environment (4) and determines a behavior output for maximizing the reward given as a result of this and an environment predicting system (2) for predicting change of the environment. The smaller the prediction error of the environment predicting system (2) of each learning module (3) is, the larger the responsibility signal is required to have. In proportion to the responsibility signal, the behavior output from the intensive learning system (1) is weighted, and a behavior affecting the environment is given. In an environment having a nonlinearity/unsteadiness, such as a control object or a system, no specific teacher signal is given. The states of various environments and behaviors optimal to the operating modes are switched and combined. Without using foresight knowledge, behavior can be learned flexibly.