Dimensional reduction of complex vectors in artificially intelligent solutions to compare similarity of natural language text
Abstract:
A web-based tool performs records matching in response to a freeform text input, to find highly contextually-related sentences in a corpus of records. Each sentence in the corpus is converted into a full-size vector representation, and each vector's angle within space is measured. Each full-size vector is compressed to a smaller vector and a loss function is used to preserve for each vector the angle within the lower-dimensional space that existed for the higher-dimensional vector. Full-size and reduced vector representations are generated from the freeform text input. The reduced-size vector of the input is compared to those of the corpus of text to identify, in real-time, a set of vector nearest neighbors that includes, with high accuracy, representations of all records in the corpus similar to the input. Full-size vectors for the nearest neighbors are in turn retrieved and compared to the input, and ranked results are generated.
Information query
Patent Agency Ranking
0/0