Invention Grant
- Patent Title: System and method for facts extraction and domain knowledge repository creation from unstructured and semi-structured documents
- Patent Title (中): 从非结构化和半结构化文档创建事实提取和领域知识库的系统和方法
-
Application No.: US12833910Application Date: 2010-07-09
-
Publication No.: US08244661B1Publication Date: 2012-08-14
- Inventor: Edward Komissarchik , Julia Komissarchik
- Applicant: Edward Komissarchik , Julia Komissarchik
- Applicant Address: US CA Hillsborough
- Assignee: Glenbrooks Networks
- Current Assignee: Glenbrooks Networks
- Current Assignee Address: US CA Hillsborough
- Agency: Foley & Lardner LLP
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06N5/02

Abstract:
Provided are methods and systems that extract facts of unstructured documents and build an oracle for various domains. The present invention addresses the problem of efficient finding and extraction of facts about a particular subject domain from semi-structured and unstructured documents, makes inferences of new facts from the extracted facts and the ways of verification of the facts, thus becoming a source of knowledge about the domain to be effectively queried. The methods and systems can also extract temporal information from unstructured and semi-structured documents, and can find and extract dynamically generated documents from Deep or Dynamic Web.
Information query