Invention Grant
- Patent Title: Language independent stemming
- Patent Title (中): 语言独立的词干
-
Application No.: US11687402Application Date: 2007-03-16
-
Publication No.: US08015175B2Publication Date: 2011-09-06
- Inventor: John Fairweather
- Applicant: John Fairweather
- Agency: Stanley J. Gradisar Attorney At Law LLC
- Agent Stanley J. Gradisar
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
A stemming framework for combining stemming algorithms together in a multilingual environment to obtain improved stemming behavior over any individual stemming algorithm, together with a new language independent stemming algorithm based on shortest path techniques. The stemmer essentially treats the stemming problem as a simple instance of the shortest path problem where the cost for each path can be computed from its word component and its number of characters. The goal of the stemmer is to find the shortest path to construct the entire word. The stemmer uses dynamic dictionaries constructed as lexical analyzer state transition tables to recognize the various allowable word parts for any given language in order to obtain maximum speed. The stemming framework provides the necessary logic to combine multiple stemmers in parallel and to merge their results to obtain the best behavior. Mapping dictionaries handle irregular plurals, tense, phrase mapping and proper name recognition.
Public/Granted literature
- US20080228748A1 LANGUAGE INDEPENDENT STEMMING Public/Granted day:2008-09-18
Information query