Invention Grant
- Patent Title: Inferring emerging and evolving topics in streaming text
- Patent Title (中): 推动流媒体文本中新兴和不断发展的话题
-
Application No.: US13315798Application Date: 2011-12-09
-
Publication No.: US08909643B2Publication Date: 2014-12-09
- Inventor: Saha Ankan , Arindam Banerjee , Shiva P. Kasiviswanathan , Richard D. Lawrence , Prem Melville , Vikas Sindhwani , Edison L. Ting
- Applicant: Saha Ankan , Arindam Banerjee , Shiva P. Kasiviswanathan , Richard D. Lawrence , Prem Melville , Vikas Sindhwani , Edison L. Ting
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Scully, Scott, Murphy & Presser, P.C.
- Agent Daniel P. Morris, Esq.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A method, system and computer program product for inferring topic evolution and emergence in a set of documents. In one embodiment, the method comprises forming a group of matrices using text in the documents, and analyzing these matrices to identify a first group of topics as evolving topics and a second group of topics as emerging topics. The matrices includes a first matrix X identifying a multitude of words in each of the documents, a second matrix W identifying a multitude of topics in each of the documents, and a third matrix H identifying a multitude of words for each of the multitude of topics. These matrices are analyzed to identify the evolving and emerging topics. In an embodiment, the documents form a streaming dataset, and two forms of temporal regularizers are used to help identify the evolving topics and the emerging topics in the streaming dataset.
Public/Granted literature
- US20130151520A1 INFERRING EMERGING AND EVOLVING TOPICS IN STREAMING TEXT Public/Granted day:2013-06-13
Information query