Invention Grant
- Patent Title: Document analysis and multi-word term detector
- Patent Title (中): 文档分析和多字词检测器
-
Application No.: US13310821Application Date: 2011-12-05
-
Publication No.: US08458198B1Publication Date: 2013-06-04
- Inventor: Michael J. Welch , Walter W. Chang
- Applicant: Michael J. Welch , Walter W. Chang
- Applicant Address: US CA San Jose
- Assignee: Adobe Systems Incorporated
- Current Assignee: Adobe Systems Incorporated
- Current Assignee Address: US CA San Jose
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A term analyzer receives an ordered collection of text-based terms. The term analyzer analyzes groupings of consecutive text-based terms in the ordered collection to identify occurrences of different combinations of text-based terms. In addition, the term analyzer maintains frequency information representing the occurrences of the different combinations of text-based terms in the collection. The frequency information can then be used to determine relatively significant keywords and/or keyword phrases in the document. In an example configuration, the term analyzer creates a tree in which a first term in a given grouping of the groupings is defined as a parent node in the tree and a second term in the given grouping is defined as a child node of the parent node in the tree. The method of the analyzer generalizes to create a tree of multi-word terms in which the terms can be efficiently ranked by occurrence.
Information query