Systems and methods for detecting matching content in code files
Abstract:
Methods systems for detecting of matching content in code files are provided. The method involves generating clusters of code files based on a degree of matching characters contained in each of the code files. A first cluster of code files is identified based on the code files having 100% matching hash codes and at least one second cluster is generated based on a character count generated for the code files that are not part of the first cluster and having a degree of match equal to or greater than a pre-determined percentage match. Such identified first cluster and at least one second cluster of code files are reported to have matching content based on the associated degree of match.
Public/Granted literature
Information query
Patent Agency Ranking
0/0