Invention Grant
- Patent Title: Revealing content reuse using fine analysis
-
Application No.: US16460967Application Date: 2019-07-02
-
Publication No.: US11341761B2Publication Date: 2022-05-24
- Inventor: Nathan Roy Evans , Christopher Miles White , Jonathan Karl Larson , Darren Keith Edge
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: G06F16/906
- IPC: G06F16/906 ; G06F16/901 ; G06V30/416 ; G06V30/414 ; G06F40/216

Abstract:
Systems and methods for managing content provenance are provided. A network system accesses a document of a plurality of documents to be analyzed. The network system extracts text fragments from the document including a first fragment and a second fragment. A determination is made whether each of the text fragments match an entry in a hash table. Based on a first fragment not matching any entries in the hash table, the network system creates a new entry in the hash table, whereby the first fragment is used to generate a key in the hash table. Based on a second fragment matching an entry of the hash table, the network system associates the document with a key of the matching entry in the hash table, whereby the associating comprising updating the hash table with an identifier of the document.
Public/Granted literature
- US20210004582A1 Revealing Content Reuse Using Fine Analysis Public/Granted day:2021-01-07
Information query