Invention Grant
- Patent Title: System and method for text categorization based on ontologies
- Patent Title (中): 基于本体的文本分类系统和方法
-
Application No.: US13872022Application Date: 2013-04-26
-
Publication No.: US08782051B2Publication Date: 2014-07-15
- Inventor: Kirill Chashchin , Sergey Anshukov , Valery Bardin , Simon Kordonsky
- Applicant: Kirill Chashchin , Sergey Anshukov , Valery Bardin , Simon Kordonsky
- Applicant Address: US NY New York
- Assignee: South Eastern Publishers Inc.
- Current Assignee: South Eastern Publishers Inc.
- Current Assignee Address: US NY New York
- Agency: Galvin Patent Law LLC
- Agent Brian R. Galvin
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/27

Abstract:
A system for text categorization based on ontologies comprising data collector software modules; a categorizer software module; and a database comprising an indexed database of documents and their categorizations, and further comprising a plurality of ontologies, each ontology comprising a plurality of hierarchical taxonomies and each hierarchical taxonomy comprising a plurality of taxons. The data collector software modules receive a document to be classified and submit them to the categorizer software module; and the categorizer performs the following steps to categorize each document: splitting the document into sentences; selecting words or phrases that are present in ontologies stored in the database server; selecting a plurality of subtrees from the ontologies based on the presence of specific subcategories in the document; determining a weight for each subcategory; pruning subcategories having a weight below a threshold; and for each of the plurality of modified subtrees, computing a conditionality coefficient.
Public/Granted literature
- US20130212111A1 SYSTEM AND METHOD FOR TEXT CATEGORIZATION BASED ON ONTOLOGIES Public/Granted day:2013-08-15
Information query