Expanding indexed terms for searching files
Abstract:
A device implementing a system for expanded search includes a processor configured to identify plural words, and generate, for each word of the plural words, a word vector based on a proximity of the word relative to other words of the plural words, the word vector comprising plural dimensions. The processor is further configured to create a compressed word vector structure comprising clusters of subsets of the plural dimensions across the word vectors, each cluster including similar values of the respective dimensions, convert the word vectors to points on at least one plane, and partition the at least one plane into nested groupings of the points based on a threshold number of points per nested grouping. The processor is further configured to create a tree look-up structure of the nested groupings, and provide the compressed word vector structure and the tree look-up structure to a client device.
Public/Granted literature
Information query
Patent Agency Ranking
0/0