Machine learning for ranking candidate subjects based on a training set
Abstract:
According to an embodiment of the present invention, a system designates each document in a collection of documents as a member of a first group containing known subjects for a concept of interest or as a member of a second group containing candidate subjects for the concept of interest and determines a subset of documents for at least one subject. The system generates a classifier based on the documents in the first and second groups and applies the classifier to a set of documents for the at least one subject to determine whether each document belong to the first and/or second group. The system generates a score for the at least one subject based on a quantity of documents for that subject assigned to the first group of documents relative to a total quantity of documents for that subject and ranks that subject based on the determined score for each subject.
Information query
Patent Agency Ranking
0/0