Event clustering and classification with document embedding
Abstract:
Embedding representation for a document is generated based on clustering words in the document. Representative clusters are selected and a weighted sum of the embeddings of the words in the selected clusters is determined as a document embedding. Documents are labeled based on document embeddings. A machine learning algorithm is trained using the documents. The machine learning algorithm predicts a label of a given document based on the given document's document embedding.
Public/Granted literature
Information query
Patent Agency Ranking
0/0