Invention Grant
US08868559B2 Representative document selection for a set of duplicate documents 有权
代表文件选择一套重复的文件

Representative document selection for a set of duplicate documents
Abstract:
Systems and methods for indexing a representative document from a set of duplicate documents are disclosed. Disclosed systems and methods comprise selecting a first document in a plurality of documents on the basis that the first document is associated with a query independent score. Each respective document in the plurality of documents has a fingerprint that indicates that the respective document has substantially identical content to every other document in the plurality of documents. Disclosed systems and methods further comprise indexing, in accordance with the query independent score, the first document thereby producing an indexed first document. With respect to the plurality of documents, only the indexed first document is included in a document index.
Public/Granted literature
Information query
Patent Agency Ranking
0/0