Invention Grant
US08370328B2 System and method for creating and maintaining a database of disambiguated entity mentions and relations from a corpus of electronic documents
有权
系统和方法,用于创建和维护一个消极实体提及的数据库,并从电子文档语料库中进行关系
- Patent Title: System and method for creating and maintaining a database of disambiguated entity mentions and relations from a corpus of electronic documents
- Patent Title (中): 系统和方法,用于创建和维护一个消极实体提及的数据库,并从电子文档语料库中进行关系
-
Application No.: US13206492Application Date: 2011-08-09
-
Publication No.: US08370328B2Publication Date: 2013-02-05
- Inventor: Michael A. Woytowitz , Marshall Wells Hawks
- Applicant: Michael A. Woytowitz , Marshall Wells Hawks
- Applicant Address: US MD Hunt Valley
- Assignee: Comsort, Inc.
- Current Assignee: Comsort, Inc.
- Current Assignee Address: US MD Hunt Valley
- Agency: Law Offices of Grady L. White, LLC
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
Method and apparatus for creating an electronic database of disambiguated entity mentions and relations from a corpus of electronic documents. The invention automatically extracts from the corpus of electronic documents mentions about entities (e.g., references to people, organizations or places), parses the entity mentions into “mention objects,” and executes a series of grouping, comparison and hierarchical fuzzy object clustering algorithms to cluster together in an electronic database all of the mention objects referring to the same entity and all of the mention objects (e.g. “people”) associated with each other by a relationship (e.g., “co-authors” or “family members”). The resulting electronic database of disambiguated entity mentions and relations, which may comprise, for example, an XML document, a relational database or hierarchical database, is structured to permit useful recordation, access, review and display of all of the mentions and relations associated with a particular entity or collection of entities.
Public/Granted literature
Information query