Invention Grant
- Patent Title: Method for clustering nodes of a textual network taking into account textual content, computer-readable storage device and system implementing said method
-
Application No.: US15480948Application Date: 2017-04-06
-
Publication No.: US10671936B2Publication Date: 2020-06-02
- Inventor: Charles Bouveyron , Pierre Latouche
- Applicant: Universite Paris Descartes , Universite Paris 1 Pantheon-Sorbonne , Centre National de la Recherche Scientifique (CNRS)
- Applicant Address: FR FR FR
- Assignee: Universite Paris Descartes,Universite Paris | Pantheon-Sorbonne,Centre National de la Recherche Scientifique (CNRS)
- Current Assignee: Universite Paris Descartes,Universite Paris | Pantheon-Sorbonne,Centre National de la Recherche Scientifique (CNRS)
- Current Assignee Address: FR FR FR
- Agency: Lerner, David, Littenberg, Krumholz & Mentlik, LLP
- Main IPC: G06N7/00
- IPC: G06N7/00 ; H04L12/24 ; G06N5/00

Abstract:
The invention relates to a method for clustering nodes of a network, the network comprising nodes associated with message edges of text data, the method comprising an initialization step of determination of a first initial clustering of the nodes, and a step of iterative inference of a generative model of text documents. Edges are modeled with a Stochastic Block Model (SBM) and the sets of documents between and within clusters are modeled according to a generative model of documents. The inference step comprises iteratively modelling the text documents and the underlying topics of their textual content, and updating the clustering as a function of the modelling, until a convergence criterion is fulfilled and an optimized clustering and corresponding optimized values of the parameters of the models are output.
Public/Granted literature
Information query
IPC分类:
G | 物理 |
G06 | 计算;推算或计数 |
G06N | 基于特定计算模型的计算机系统 |
G06N7/00 | 基于特定数学模式的计算机系统 |