Invention Grant
- Patent Title: Apparatus and method for identifying similarity via dynamic decimation of token sequence N-grams
-
Application No.: US14754869Application Date: 2015-06-30
-
Publication No.: US09910985B2Publication Date: 2018-03-06
- Inventor: Jonathan D. Cohen
- Applicant: The Johns Hopkins University
- Applicant Address: US MD Baltimore
- Assignee: The Johns Hopkins University
- Current Assignee: The Johns Hopkins University
- Current Assignee Address: US MD Baltimore
- Agent Sung T. Kim
- Main IPC: G06F12/14
- IPC: G06F12/14 ; G06F21/56 ; G06F17/27

Abstract:
An apparatus for identifying related code variants or text samples includes processing circuitry configured to execute instructions for receiving query binary code, processing the query binary code to generate one or more query code fingerprints comprising compressed representations of respective functional components of the query binary code, generating token sequence n-grams of the fingerprints, hashing the n-grams, partitioning samples by length to compare selected samples based on length, and identifying similarity via dynamic decimation of token sequence n-grams.
Public/Granted literature
- US20150302197A1 Apparatus and Method for Identifying Similarity Via Dynamic Decimation of Token Sequence N-Grams Public/Granted day:2015-10-22
Information query