Invention Grant
US08595245B2 Reference resolution for text enrichment and normalization in mining mixed data
有权
文本丰富和采矿混合数据正常化的参考决议
- Patent Title: Reference resolution for text enrichment and normalization in mining mixed data
- Patent Title (中): 文本丰富和采矿混合数据正常化的参考决议
-
Application No.: US11493085Application Date: 2006-07-26
-
Publication No.: US08595245B2Publication Date: 2013-11-26
- Inventor: Bruno Cavestro , Jean-Michel Renders
- Applicant: Bruno Cavestro , Jean-Michel Renders
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Fay Sharpe LLP
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30 ; G06F17/00

Abstract:
A method for enrichment of text which enables mixed data mining includes generating a model for structured data found in tables of a database. In the model, semantically-linked terms are associated with referents, such as field names or cell content of the fields, of the structured data. The referents may be a business object or refer to a business object. A plurality of candidate referring entities in textual data in the database, such as chunks of free text, is identified. For each candidate referring entity, a similarity measure between the candidate referring entity in the textual data and the model is computed to identify referring entities of the candidate referring entities and corresponding business objects/referents to which the referring entities refer. The textual data is enriched with information derived from the business objects.
Public/Granted literature
- US20080027893A1 Reference resolution for text enrichment and normalization in mining mixed data Public/Granted day:2008-01-31
Information query