- Patent Title: Systems and methods for detecting matching content in code files
-
Application No.: US15018501Application Date: 2016-02-08
-
Publication No.: US10176186B2Publication Date: 2019-01-08
- Inventor: Vishal Barad , Manojkumar Ghanshyamdas Rochani
- Applicant: Tata Consultancy Services Limited
- Applicant Address: IN Mumbai
- Assignee: Tata Consultancy Services Limited
- Current Assignee: Tata Consultancy Services Limited
- Current Assignee Address: IN Mumbai
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner LLP
- Priority: IN4663/MUM/2015 20151211
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30 ; G06F8/75 ; G06F8/70

Abstract:
Methods systems for detecting of matching content in code files are provided. The method involves generating clusters of code files based on a degree of matching characters contained in each of the code files. A first cluster of code files is identified based on the code files having 100% matching hash codes and at least one second cluster is generated based on a character count generated for the code files that are not part of the first cluster and having a degree of match equal to or greater than a pre-determined percentage match. Such identified first cluster and at least one second cluster of code files are reported to have matching content based on the associated degree of match.
Public/Granted literature
- US20170169045A1 SYSTEMS AND METHODS FOR DETECTING MATCHING CONTENT IN CODE FILES Public/Granted day:2017-06-15
Information query