Invention Grant
- Patent Title: Methods and systems for document classification using machine learning
-
Application No.: US16281501Application Date: 2019-02-21
-
Publication No.: US10970595B2Publication Date: 2021-04-06
- Inventor: Deepti Aggarwal , Jayanta Basak , Siddhartha Nandi
- Applicant: NETAPP, INC.
- Applicant Address: US CA Sunnyvale
- Assignee: NETAPP, INC.
- Current Assignee: NETAPP, INC.
- Current Assignee Address: US CA Sunnyvale
- Agency: Klein, O'Neill & Singh, LLP
- Main IPC: G06F7/02
- IPC: G06F7/02 ; G06F16/00 ; G06K9/62 ; G06F16/38 ; G06N20/00 ; G06F16/93 ; G06F40/284

Abstract:
Methods and systems for document classification are provided. One method includes generating by a processor, a plurality of topics using content of a plurality of electronic documents, where each topic includes a plurality of words associated with the plurality of electronic documents; reducing by the processor, the plurality of topics to a subset of topics to represent the plurality of electronic documents based on a parameter indicating a property of each subset topic and separation between the subset topics; automatically generating by the processor, a tag for each subset topic, based on the tag's position within the subset topic; wherein each tag is an attribute of each subset topic; storing by the processor, the subset of topics with corresponding tags in a model data structure; and updating the model data structure by the processor based on one of a new topic and a new tag associated with an electronic document.
Public/Granted literature
- US20190392250A1 METHODS AND SYSTEMS FOR DOCUMENT CLASSIFICATION USING MACHINE LEARNING Public/Granted day:2019-12-26
Information query