Invention Grant
- Patent Title: Method, system and apparatus for automatic keyword extraction
- Patent Title (中): 自动关键字提取的方法,系统和设备
-
Application No.: US12614387Application Date: 2009-11-06
-
Publication No.: US08346534B2Publication Date: 2013-01-01
- Inventor: Andras Csomai , Rada Mihalcea
- Applicant: Andras Csomai , Rada Mihalcea
- Applicant Address: US TX Denton
- Assignee: University of North Texas System
- Current Assignee: University of North Texas System
- Current Assignee Address: US TX Denton
- Agency: Chalker Flores, LLP
- Agent Daniel J. Chalker; Edwin S. Flores
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F17/21

Abstract:
The present invention provides a method and a system for automatic keyword extraction based on supervised or unsupervised machine learning techniques. Novel linguistically-motivated machine learning features are introduced, including discourse comprehension features based on construction integration theory, numeric features making use of syntactic part-of-speech patterns, and probabilistic features based on analysis of online encyclopedia annotations. The improved keyword extraction methods are combined with word sense disambiguation into a system for automatically generating annotations to enrich text with links to encyclopedic knowledge.
Public/Granted literature
- US20100145678A1 Method, System and Apparatus for Automatic Keyword Extraction Public/Granted day:2010-06-10
Information query