Generation and use of trained file classifiers for malware detection

    公开(公告)号:US10304010B2

    公开(公告)日:2019-05-28

    申请号:US15610191

    申请日:2017-05-31

    Inventor: Na Sai

    Abstract: A method includes receiving one or more n-gram vectors for a file as input to a file classifier, where the one or more n-gram vectors indicate occurrences of groups of entropy indicators in a sequence of entropy indicators representing the file. The method also includes generating, based on the one or more n-gram vectors, output including classification data associated with the file, the classification data indicating whether the file includes malware.

    Generation and use of trained file classifiers for malware detection

    公开(公告)号:US10062038B1

    公开(公告)日:2018-08-28

    申请号:US15610228

    申请日:2017-05-31

    Inventor: Na Sai

    CPC classification number: G06N20/00 G06F21/562 G06F2221/033 G06N3/02

    Abstract: A method includes accessing information identifying multiple files and identifying classification data for the multiple files, where the classification data indicates, for a particular file of the multiple files, whether the particular file includes malware. The method also includes generating a sequence of entropy indicators for each of the multiple files, each entropy indicator of the sequence of entropy indicators for the particular file corresponding to a chunk of the particular file. The method further includes generating n-gram vectors for the multiple files, where the n-gram vector for the particular file indicates occurrences of groups of entropy indicators in the sequence of entropy indicators for the particular file. The method also includes generating and storing a file classifier using the n-gram vectors and the classification data as supervised training data.

    Display screen with graphical user interface

    公开(公告)号:USD1004613S1

    公开(公告)日:2023-11-14

    申请号:US29820581

    申请日:2021-12-22

    Abstract: FIG. 1 is a front view of the display screen with graphical user interface showing a first embodiment of the design.
    FIG. 2 is a second embodiment thereof.
    FIG. 3 is a third embodiment thereof.
    FIG. 4 is a fourth embodiment thereof; and,
    FIG. 5 is a fifth embodiment thereof.
    The broken lines shown in the Figures form no part of the claimed design. The broken line with long line segments separated by short line segments represents an outer edge of the display screen. The remaining broken lines represent portions of the graphical user interface.

    Generation and use of trained file classifiers for malware detection

    公开(公告)号:US10068187B1

    公开(公告)日:2018-09-04

    申请号:US15610173

    申请日:2017-05-31

    Inventor: Na Sai

    Abstract: A method includes accessing information identifying multiple files and identifying classification data for the multiple files, where the classification data indicates, for a particular file of the multiple files, whether the particular file includes malware. The method also includes generating n-gram vectors for the multiple files by, for each file, generating an n-gram vector indicating occurrences of character pairs in printable characters representing the file. The method further includes generating and storing a file classifier using the n-gram vectors and the classification data as supervised training data.

    Generation and use of trained file classifiers for malware detection

    公开(公告)号:US09864956B1

    公开(公告)日:2018-01-09

    申请号:US15583565

    申请日:2017-05-01

    Inventor: Na Sai

    CPC classification number: G06N99/005 G06F21/562 G06F2221/033 G06N3/02

    Abstract: A method includes training a file classifier from one or more n-gram feature vectors received from a plurality of binary files as input, where the one or more n-gram vectors represent the occurrences of character pairs in printable characters within the file or characters representing the informational entropy sequence of the file. Another method also includes generating, by the file classifier, output including classification data associated with the file based on the one or more n-gram vectors, where the classification data indicates whether the file includes malware.

Patent Agency Ranking