Invention Grant
- Patent Title: Discovering terms using statistical corpus analysis
-
Application No.: US14722984Application Date: 2015-05-27
-
Publication No.: US10592605B2Publication Date: 2020-03-17
- Inventor: Jitendra Ajmera , Ankur Parikh
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent William H. Hartwell
- Main IPC: G06F16/27
- IPC: G06F16/27 ; G06F17/27 ; G06F16/35 ; G06F16/34 ; G06F16/33

Abstract:
Software that extracts contextually relevant terms from a text sample (or corpus) by performing the following steps: (i) identifying a first term from a corpus, based, at least in part, on a set of initial contextual characteristic(s), where each initial contextual characteristic of the set of initial contextual characteristic(s) relates to the contextual use of at least one category related term of a set of category related term(s) in the corpus; (ii) adding the first term to the set of category related term(s), thereby creating a revised set of category related term(s) and a set of first term contextual characteristic(s), where each first term contextual characteristic of the set of first term contextual characteristic(s) relates to the contextual use of the first term in the corpus; and (iii) identifying a second term from the corpus, based, at least in part, on the set of first term contextual characteristic(s).
Public/Granted literature
- US20160117313A1 DISCOVERING TERMS USING STATISTICAL CORPUS ANALYSIS Public/Granted day:2016-04-28
Information query