Heuristic identification of shared substrings between text documents
Abstract:
Technologies for document evaluation and identification of shared textual substrings between documents are described herein. Documents are evaluated and organized according to textual elements within the documents. A suffix index is generated from a reference document. The suffix index is used to identify common substrings of text within query documents using variable evaluation windows within the query documents. Indications of overlapping textual information between the reference document and query documents is generated as an output.
Information query
Patent Agency Ranking
0/0