MULTI-MODAL ELECTRONIC DOCUMENT CLASSIFICATION

    公开(公告)号:WO2020018370A9

    公开(公告)日:2020-01-23

    申请号:PCT/US2019/041590

    申请日:2019-07-12

    Applicant: NETAPP, INC.

    Abstract: A method comprising operating at least one hardware processor for: receiving, as input, a plurality of electronic documents, training a machine learning classifier based, at least on part, on a training set comprising: (i) labels associated with the electronic documents, (ii) raw text from each of said plurality of electronic documents, and (iii) a rasterized version of each of said plurality of electronic documents, and applying said machine learning classifier to classify one or more new electronic documents.

    MULTI-MODAL ELECTRONIC DOCUMENT CLASSIFICATION

    公开(公告)号:WO2020018370A1

    公开(公告)日:2020-01-23

    申请号:PCT/US2019/041590

    申请日:2019-07-12

    Applicant: NETAPP, INC.

    Abstract: A method comprising operating at least one hardware processor for: receiving, as input, a plurality of electronic documents, training a machine learning classifier based, at least on part, on a training set comprising: (i) labels associated with the electronic documents, (ii) raw text from each of said plurality of electronic documents, and (iii) a rasterized version of each of said plurality of electronic documents, and applying said machine learning classifier to classify one or more new electronic documents.

Patent Agency Ranking