Triage of training data for acceleration of large-scale machine learning
Abstract:
Triage of training data for acceleration of large-scale machine learning is provided. In various embodiments, training input from a set of training data is provided to an artificial neural network. The artificial neural network comprises a plurality of output neurons. Each output neuron corresponds to a class. From the artificial neural network, output values are determined at each of the plurality of output neurons. From the output values, a classification of the training input by the artificial neural network is determined. A confidence value of the classification is determined. Based on the confidence value, a probability of inclusion of the training input in subsequent training is determined. A subset of the set of training data is determined based on the probability. The artificial neural network is trained based on the subset.
Information query
Patent Agency Ranking
0/0