Event clustering and classification with document embedding

Invention Grant

US10762439B2 Event clustering and classification with document embedding 有权

Please log in to see more content

Patent Title: Event clustering and classification with document embedding
Application No.: US15219401

Application Date: 2016-07-26
Publication No.: US10762439B2

Publication Date: 2020-09-01
Inventor: Feng Cao , Boliang Chen , Zheng Yu
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Scully, Scott, Murphy & Presser, P.C.
Agent Joseph Petrokaitis
Main IPC: G06N20/00
IPC: G06N20/00 ; G06F16/35 ; G06N20/10 ; G06N5/02

Event clustering and classification with document embedding

Abstract:

Embedding representation for a document is generated based on clustering words in the document. Representative clusters are selected and a weighted sum of the embeddings of the words in the selected clusters is determined as a document embedding. Documents are labeled based on document embeddings. A machine learning algorithm is trained using the documents. The machine learning algorithm predicts a label of a given document based on the given document's document embedding.

Public/Granted literature

US20180032897A1 EVENT CLUSTERING AND CLASSIFICATION WITH DOCUMENT EMBEDDING Public/Granted day:2018-02-01

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习