Invention Grant
- Patent Title: Clique based clustering for named entity recognition system
- Patent Title (中): 用于命名实体识别系统的基于群体的聚类
-
Application No.: US12167382Application Date: 2008-07-03
-
Publication No.: US08275608B2Publication Date: 2012-09-25
- Inventor: Julien Ah-Pine , Guillaume Jacquet
- Applicant: Julien Ah-Pine , Guillaume Jacquet
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Fay Sharpe LLP
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G10L15/06 ; G06F17/21 ; G06F17/30 ; G06F7/00

Abstract:
A soft clustering method comprises (i) grouping items into non-exclusive cliques based on features associated with the items, and (ii) clustering the non-exclusive cliques using a hard clustering algorithm to generate item groups on the basis of mutual similarity of the features of the items constituting the cliques. In some named entity recognition embodiments illustrated herein as examples, named entities together with contexts are grouped into cliques based on mutual context similarity. Each clique includes a plurality of different named entities having mutual context similarity. The cliques are clustered to generate named entity groups on the basis of mutual similarity of the contexts of the named entities constituting the cliques.
Public/Granted literature
- US20100004925A1 Clique based clustering for named entity recognition system Public/Granted day:2010-01-07
Information query