Invention Grant
US08086548B2 Measuring document similarity by inferring evolution of documents through reuse of passage sequences
有权
通过重复使用通过序列来推断文档的演化来测量文档的相似性
- Patent Title: Measuring document similarity by inferring evolution of documents through reuse of passage sequences
- Patent Title (中): 通过重复使用通过序列来推断文档的演化来测量文档的相似性
-
Application No.: US12774426Application Date: 2010-05-05
-
Publication No.: US08086548B2Publication Date: 2011-12-27
- Inventor: Oliver Brdiczka , Maurice K. Chu
- Applicant: Oliver Brdiczka , Maurice K. Chu
- Applicant Address: US CA Palo Alto
- Assignee: Palo Alto Research Center Incorporated
- Current Assignee: Palo Alto Research Center Incorporated
- Current Assignee Address: US CA Palo Alto
- Agency: Park, Vaughan, Fleming & Dowler LLP
- Agent Shun Yao
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
One embodiment of the present invention provides a system for estimating document similarity. During operation, the system selects a collection of documents which includes a first set of passages, constructs a passage-sequence model based on the first set of passages, receives a new document which includes a second set of passages, and determines a sequence of operations associated with the new document in relation to the collection of documents based on the constructed passage-sequence model.
Public/Granted literature
- US20110276523A1 MEASURING DOCUMENT SIMILARITY BY INFERRING EVOLUTION OF DOCUMENTS THROUGH REUSE OF PASSAGE SEQUENCES Public/Granted day:2011-11-10
Information query