Invention Grant
- Patent Title: Identifying topics in a digital work
-
Application No.: US13433028Application Date: 2012-03-28
-
Publication No.: US09613003B1Publication Date: 2017-04-04
- Inventor: Joshua M. Goodspeed , Janna S. Hamaker , Adam J. Iser , Tom Killalea , Abhishek Patnia , Alla Taborisskaya
- Applicant: Joshua M. Goodspeed , Janna S. Hamaker , Adam J. Iser , Tom Killalea , Abhishek Patnia , Alla Taborisskaya
- Applicant Address: US NV Reno
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US NV Reno
- Agency: Lee & Hayes, PLLC
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F3/00 ; G06F3/048 ; G06F17/00 ; G06F17/21

Abstract:
In some implementations, text is extracted from a digital work and a plurality of noun phrases are identified. The noun phrases are checked against a network accessible resource, such as an online encyclopedia, that includes a plurality of interlinked article entries. The noun phrases that have corresponding entries in the network accessible resource are included in a set of candidate topics. The candidate topics are ranked based, at least in part, on the links to and from each of the entries corresponding to the candidate topics. Candidate topics below a ranking threshold are removed from the set of candidate topics. Further, term frequency information for each candidate topic in relation to the digital work is compared against term frequency information for the candidate topic in a large corpus of textual works to remove candidate topics within a frequency difference threshold.
Information query