Invention Grant
- Patent Title: Identifying entities in a digital work
-
Application No.: US13431838Application Date: 2012-03-27
-
Publication No.: US09639518B1Publication Date: 2017-05-02
- Inventor: Joshua M. Goodspeed , Janna S. Hamaker , Adam J. Iser , Tom Killalea , Abhishek Patnia , Alla Taborisskaya
- Applicant: Joshua M. Goodspeed , Janna S. Hamaker , Adam J. Iser , Tom Killalea , Abhishek Patnia , Alla Taborisskaya
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Lee & Hayes, PLLC
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F3/00 ; G06F3/048 ; G06F17/00

Abstract:
In some implementations, text is extracted from a digital work and proper nouns are identified in the text to generate a list of names. The list of names may be sorted so that names containing more information are positioned toward the beginning of the list. The list may be traversed to cluster names and alternate names into name sets that correspond to particular entities in the digital work. Non-unique names that appear in more than one name set may be disambiguated based on proximity to unique names in the same name sets to determine which occurrences of the non-unique names belong with which name sets. Furthermore, a representative name may be selected from among multiple names in a name set for use in representing an entity or object corresponding to the name set. In some examples, the representative name may be selected based on a fullness of the name.
Information query