Invention Grant
- Patent Title: Integrating and extracting topics from content of heterogeneous sources
- Patent Title (中): 从异构源的内容集成和提取主题
-
Application No.: US14014122Application Date: 2013-08-29
-
Publication No.: US09176969B2Publication Date: 2015-11-03
- Inventor: Sitaram Asur , Rumi Ghosh
- Applicant: Hewlett-Packard Development Company, L.P.
- Applicant Address: US TX Houston
- Assignee: Hewlett-Packard Development Company, L.P.
- Current Assignee: Hewlett-Packard Development Company, L.P.
- Current Assignee Address: US TX Houston
- Agency: Hewlett-Packard Patent Department
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Examples relate to integrating and extracting topics from content of heterogeneous sources. Observed words are identified in documents, which are received from the heterogeneous sources. Next, document metadata and source metadata are obtained from the heterogeneous sources. The document metadata is used to calculate word topic probabilities for the observed words, and the source metadata is used to calculate source topic probabilities for the observed words. A latent topic is then determined for one of the documents based on the observed words, the word topic probabilities, and the source topic probabilities.
Public/Granted literature
- US20150066904A1 INTEGRATING AND EXTRACTING TOPICS FROM CONTENT OF HETEROGENEOUS SOURCES Public/Granted day:2015-03-05
Information query