Method for clustering nodes of a textual network taking into account textual content, computer-readable storage device and system implementing said method

Invention Grant

US10671936B2 Method for clustering nodes of a textual network taking into account textual content, computer-readable storage device and system implementing said method 审中-公开

Please log in to see more content

Patent Title: Method for clustering nodes of a textual network taking into account textual content, computer-readable storage device and system implementing said method
Application No.: US15480948

Application Date: 2017-04-06
Publication No.: US10671936B2

Publication Date: 2020-06-02
Inventor: Charles Bouveyron , Pierre Latouche
Applicant: Universite Paris Descartes , Universite Paris 1 Pantheon-Sorbonne , Centre National de la Recherche Scientifique (CNRS)
Applicant Address: FR FR FR
Assignee: Universite Paris Descartes,Universite Paris | Pantheon-Sorbonne,Centre National de la Recherche Scientifique (CNRS)
Current Assignee: Universite Paris Descartes,Universite Paris | Pantheon-Sorbonne,Centre National de la Recherche Scientifique (CNRS)
Current Assignee Address: FR FR FR
Agency: Lerner, David, Littenberg, Krumholz & Mentlik, LLP
Main IPC: G06N7/00
IPC: G06N7/00 ; H04L12/24 ; G06N5/00

Method for clustering nodes of a textual network taking into account textual content, computer-readable storage device and system implementing said method

Abstract:

The invention relates to a method for clustering nodes of a network, the network comprising nodes associated with message edges of text data, the method comprising an initialization step of determination of a first initial clustering of the nodes, and a step of iterative inference of a generative model of text documents. Edges are modeled with a Stochastic Block Model (SBM) and the sets of documents between and within clusters are modeled according to a generative model of documents. The inference step comprises iteratively modelling the text documents and the underlying topics of their textual content, and updating the clustering as a function of the modelling, until a convergence criterion is fulfilled and an optimized clustering and corresponding optimized values of the parameters of the models are output.

Public/Granted literature

US20180293505A1 METHOD FOR CLUSTERING NODES OF A TEXTUAL NETWORK TAKING INTO ACCOUNT TEXTUAL CONTENT, COMPUTER-READABLE STORAGE DEVICE AND SYSTEM IMPLEMENTING SAID METHOD Public/Granted day:2018-10-11

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N7/00	基于特定数学模式的计算机系统