Fusion of cluster labeling algorithms by analyzing sub-clusters
Abstract:
According to some embodiments of the present invention there is provided a computerized method for labeling a cluster of text documents. The method comprises receiving a document cluster and producing automatically multiple document sub-clusters determined by randomly changing some documents. The method applies multiple cluster labeling algorithms on the cluster and on each sub-cluster, to generate ordered lists. The method comprises generating a ranked label list for each cluster labeling algorithm by computing automatically label values, one for each cluster label in the lists of the selected algorithm, and re-ranking the ordered list. The method combines the re-ranked label lists using a label fusing algorithm to produce a fused label list.
Public/Granted literature
Information query
Patent Agency Ranking
0/0