Invention Grant
- Patent Title: Method and system for assessing similarity of documents
-
Application No.: US16692005Application Date: 2019-11-22
-
Publication No.: US10970536B2Publication Date: 2021-04-06
- Inventor: Jeroen Mattijs van Rotterdam , Michael T Mohen , Chao Chen , Kun Zhao
- Applicant: Open Text Corporation
- Applicant Address: CA Waterloo
- Assignee: Open Text Corporation
- Current Assignee: Open Text Corporation
- Current Assignee Address: CA Waterloo
- Agency: Sprinkle IP Law Group
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06F16/93 ; G06F16/335

Abstract:
Systems and methods for assessing similarity of documents are provided. Embodiments of the systems and methods include extracting a reference document text from a reference document, extracting an archived document text from an archived document, and quantifying the reference document and the archived document. The systems and methods may also include determining a document similarity value of the quantified reference document and the archived document. Determining the document similarity value includes calculating a set of vector similarity values for a set of combinations of a reference document text vector and an archived document text vector, and calculating the document similarity value, including a sum of the plurality of vector similarity values.
Public/Granted literature
- US20200089947A1 METHOD AND SYSTEM FOR ASSESSING SIMILARITY OF DOCUMENTS Public/Granted day:2020-03-19
Information query