Interaction network inference from vector representation of words
Abstract:
The present disclosure relates to a system and method for extracting information from text data. The method comprises: obtaining a plurality of text elements. A word embedding algorithm may be applied to the obtained text elements by mapping each text element of at least part of the text elements into a vector of a predefined dimension. The mapped text elements may be clustered into groups using the distances between the respective vectors. For each text element of a set of text elements of the mapped text elements a respective distribution of neighbors across the groups may be built. Similarity scores may be computed using the distributions thereby for determining relations between the set of text elements.
Public/Granted literature
Information query
Patent Agency Ranking
0/0