Invention Grant
- Patent Title: Identification of changes between document versions
-
Application No.: US16806438Application Date: 2020-03-02
-
Publication No.: US11630869B2Publication Date: 2023-04-18
- Inventor: Arvind Agarwal , Vitobha Munigala , Mitesh H. Vasa , Shanmukha Guttula , Ankush Gupta , Nicholas Gomez Phan
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Ference & Associates LLC
- Main IPC: G06F16/93
- IPC: G06F16/93 ; G06F16/28 ; G06F40/30 ; G06N20/00 ; G06F16/2458

Abstract:
One embodiment provides a method, including: obtaining at least two documents, wherein one of the at least two documents comprises a revision different than another of the at least two documents; identifying, within each of the at least two documents, portions corresponding to groups of text containing a conceptual unit; assigning at least a subset of the identified portions to a category type corresponding to a topic of a given portion, wherein the assigning comprises (i) generating a semantic tag for the identified portions in the subset and (ii) tagging the identified portions in the subset with the semantic tag; and determining changes between the at least two documents, wherein the determining comprises (iii) aligning given portions across the at least two documents based upon a relationship between the given portions across the at least two documents, (iv) identifying semantic differences between the aligned portions, and (v) identifying any remaining unaligned portions.
Public/Granted literature
- US20210271718A1 IDENTIFICATION OF CHANGES BETWEEN DOCUMENT VERSIONS Public/Granted day:2021-09-02
Information query