Invention Grant
- Patent Title: Detection of document similarity
-
Application No.: US15332842Application Date: 2016-10-24
-
Publication No.: US10769213B2Publication Date: 2020-09-08
- Inventor: Keke Cai , HongLei Guo , Zhili Guo , Feng Jin , Zhong Su
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Amin, Turocy & Watson, LLP
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F16/93 ; G06F16/33 ; G06F16/35

Abstract:
Techniques for detection of document similarity are provided. The computer-implemented method can comprise identifying, by an electronic device operatively coupled to a processing unit, a first pragmatic association of a first segment in a first document portion, the first pragmatic association indicating meaning of the first segment specific to a context of the first segment in the first document portion. The computer-implemented method can also comprise generating a first intermediate document portion from the first document portion by using the first pragmatic association to replace the first segment. The computer-implemented method can further comprise determining a similarity degree between the first document portion and a second document portion by comparing the first intermediate document portion with the second document portion.
Public/Granted literature
- US20180113861A1 DETECTION OF DOCUMENT SIMILARITY Public/Granted day:2018-04-26
Information query