Invention Grant
- Patent Title: File format identification system
-
Application No.: US17748906Application Date: 2022-05-19
-
Publication No.: US12105751B2Publication Date: 2024-10-01
- Inventor: Marian Radu
- Applicant: CrowdStrike, Inc.
- Applicant Address: US CA Sunnyvale
- Assignee: CrowdStrike, Inc.
- Current Assignee: CrowdStrike, Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Lee & Hayes, P.C.
- Main IPC: G06F16/11
- IPC: G06F16/11 ; G06F9/30 ; G06F16/51 ; G06F16/55 ; G06N20/00

Abstract:
A file format identification system can predict file formats associated with binary data. The file format identification system can extract n-grams, such as byte 4-grams, from the binary data. A trained neural network with at least one embedding layer can generate embedding arrays that correspond to the extracted n-grams. A trained file format classifier can compare values in the embedding arrays with patterns of values associated with known file formats. The trained file format classifier can accordingly determine which of the known file formats are most likely to be associated with the binary data.
Public/Granted literature
- US20230376526A1 FILE FORMAT IDENTIFICATION SYSTEM Public/Granted day:2023-11-23
Information query