Invention Grant
- Patent Title: Graph based re-composition of document fragments for name entity recognition under exploitation of enterprise databases
- Patent Title (中): 基于图表的企业数据库开发下的名称实体识别文档片段的重组
-
Application No.: US12413611Application Date: 2009-03-30
-
Publication No.: US08229883B2Publication Date: 2012-07-24
- Inventor: Falk Brauer , Wojciech Barczynski , Hong-Hai Do , Alexander Löser , Marcus Schramm
- Applicant: Falk Brauer , Wojciech Barczynski , Hong-Hai Do , Alexander Löser , Marcus Schramm
- Applicant Address: DE Walldorf
- Assignee: SAP AG
- Current Assignee: SAP AG
- Current Assignee Address: DE Walldorf
- Main IPC: G06F17/20
- IPC: G06F17/20 ; G06F17/30

Abstract:
Methods and systems are described that involve recognizing complex entities from text documents with the help of structured data and Natural Language Processing (NLP) techniques. In one embodiment, the method includes receiving a document as input from a set of documents, wherein the document contains text or unstructured data. The method also includes identifying a plurality of text segments from the document via a set of tagging techniques. Further, the method includes matching the identified plurality of text segments against attributes of a set of predefined entities. Lastly, a best matching predefined entity is selected for each text segment from the plurality of text segments.In one embodiment, the system includes a set of documents, each document containing text or unstructured data. The system also includes a database storage unit that stores a set of predefined entities, wherein each entity contains a set of attributes. Further, the system includes a processor to identify a plurality of text segments from a document via a set of tagging techniques and to match the identified plurality of text segments against the set of attributes.
Public/Granted literature
Information query