Analysis of theme coverage of documents
Abstract:
According to an aspect of an embodiment, operations may include obtaining multiple electronic documents and obtaining a theme text. The method may also include selecting a seed text based on a semantic similarity between the seed text and the theme text. The method may also include changing a seed weight included in a weight vector that is used in identification of topics of the multiple electronic documents. The changed seed weight may bias the identification of topics of the plurality of electronic documents in favor of the seed text as compared to one or more other text strings of the weight vector. The method may also include generating, a representation of a topic model for display to a user, the topic model may be based on the multiple electronic documents and the weight vector.
Public/Granted literature
Information query
Patent Agency Ranking
0/0