Invention Grant
- Patent Title: Lexical association metric for knowledge-free extraction of phrasal terms
- Patent Title (中): 用于无知提取短语条款的词汇关联度量
-
Application No.: US12814730Application Date: 2010-06-14
-
Publication No.: US08078452B2Publication Date: 2011-12-13
- Inventor: Paul Deane
- Applicant: Paul Deane
- Applicant Address: US NJ Princeton
- Assignee: Educational Testing Service
- Current Assignee: Educational Testing Service
- Current Assignee Address: US NJ Princeton
- Agency: Jones Day
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F17/21

Abstract:
A method and system for determining a lexical association of phrasal terms are described. A corpus having a plurality of words is received, and a plurality of contexts including one or more context words proximate to a word in the corpus is determined. An occurrence count for each context is determined, and a global rank is assigned based on the occurrence count. Similarly, a number of occurrences of a word being used in a context is determined, and a local rank is assigned to the word-context pair based on the number of occurrences. A rank ratio is then determined for each word-context pair. A rank ratio is equal to the global rank divided by the local rank for a word-context pair. A mutual rank ratio is determined by multiplying the rank ratios corresponding to a phrase. The mutual rank ratio is used to identify phrasal terms in the corpus.
Public/Granted literature
- US20100250238A1 Lexical Association Metric for Knowledge-Free Extraction of Phrasal Terms Public/Granted day:2010-09-30
Information query