Invention Grant
- Patent Title: Source code similarity detection using digital fingerprints
-
Application No.: US16242414Application Date: 2019-01-08
-
Publication No.: US11099842B2Publication Date: 2021-08-24
- Inventor: Firas Alomari
- Applicant: Saudi Arabian Oil Company
- Applicant Address: SA Dhahran
- Assignee: Saudi Arabian Oil Company
- Current Assignee: Saudi Arabian Oil Company
- Current Assignee Address: SA Dhahran
- Agency: Leason Ellis LLP
- Main IPC: G06F8/75
- IPC: G06F8/75 ; G06F16/901 ; G06F8/41

Abstract:
Similarity in source code is identified by searching digital fingerprints representing at least control flow of blocks of programming statements. At least some of the source code is converted into a plurality of respective tokens. Each of the tokens is associated with a plurality of blocks. Tokens are modified by normalizing at least one value in at least one of the blocks and/or by defining at least one abstraction. Thereafter, a representation of control flow is created, and a digital fingerprint representing at control flow of a token is generated. Thereafter, source code within at least one block of a given token is determined and identifiable as being a duplicate of source code stored in a repository by comparing at least one of the generated digital fingerprints and at least one previously generated digital fingerprint.
Public/Granted literature
- US20200218535A1 SOURCE CODE SIMILARITY DETECTION USING DIGITAL FINGERPRINTS Public/Granted day:2020-07-09
Information query