Invention Grant
- Patent Title: Event clustering and classification with document embedding
-
Application No.: US15219401Application Date: 2016-07-26
-
Publication No.: US10762439B2Publication Date: 2020-09-01
- Inventor: Feng Cao , Boliang Chen , Zheng Yu
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Scully, Scott, Murphy & Presser, P.C.
- Agent Joseph Petrokaitis
- Main IPC: G06N20/00
- IPC: G06N20/00 ; G06F16/35 ; G06N20/10 ; G06N5/02

Abstract:
Embedding representation for a document is generated based on clustering words in the document. Representative clusters are selected and a weighted sum of the embeddings of the words in the selected clusters is determined as a document embedding. Documents are labeled based on document embeddings. A machine learning algorithm is trained using the documents. The machine learning algorithm predicts a label of a given document based on the given document's document embedding.
Public/Granted literature
- US20180032897A1 EVENT CLUSTERING AND CLASSIFICATION WITH DOCUMENT EMBEDDING Public/Granted day:2018-02-01
Information query