Invention Grant
- Patent Title: System and method for determining affixes of words
- Patent Title (中): 用于确定单词词缀的系统和方法
-
Application No.: US10658968Application Date: 2003-09-09
-
Publication No.: US07941310B2Publication Date: 2011-05-10
- Inventor: Youngja Park
- Applicant: Youngja Park
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Scully, Scott, Murphy & Presser, P.C.
- Agent Louis J. Percello, Esq.
- Main IPC: G06F17/28
- IPC: G06F17/28

Abstract:
A computer system and a method for analyzing text in one or more electronic documents are disclosed. The computer system comprises one or more system interfaces; and an affix process that determines one or more affixes of one or more words in one or more of the documents and provides the affixes to the system interface. The preferred embodiment of the invention may be used to build a domain specific morphology lexicon for NLP applications so that they can recognize out-of-vocabulary words. The disclosed procedure utilizes the fact that the processes of discovering prefixes and suffixes are not independent. Many words, especially in technical documents, have complex morphological structures, and thus the knowledge about prefixes helps the discovery of suffixes and vice versa.
Public/Granted literature
- US20050055200A1 System and method for determining affixes of words Public/Granted day:2005-03-10
Information query