Invention Grant
US08909643B2 Inferring emerging and evolving topics in streaming text 有权
推动流媒体文本中新兴和不断发展的话题

Inferring emerging and evolving topics in streaming text
Abstract:
A method, system and computer program product for inferring topic evolution and emergence in a set of documents. In one embodiment, the method comprises forming a group of matrices using text in the documents, and analyzing these matrices to identify a first group of topics as evolving topics and a second group of topics as emerging topics. The matrices includes a first matrix X identifying a multitude of words in each of the documents, a second matrix W identifying a multitude of topics in each of the documents, and a third matrix H identifying a multitude of words for each of the multitude of topics. These matrices are analyzed to identify the evolving and emerging topics. In an embodiment, the documents form a streaming dataset, and two forms of temporal regularizers are used to help identify the evolving topics and the emerging topics in the streaming dataset.
Public/Granted literature
Information query
Patent Agency Ranking
0/0