Invention Grant
- Patent Title: System and method for facts extraction and domain knowledge repository creation from unstructured and semi-structured documents
- Patent Title (中): 从非结构化和半结构化文档创建事实提取和领域知识库的系统和方法
-
Application No.: US13802411Application Date: 2013-03-13
-
Publication No.: US08682674B1Publication Date: 2014-03-25
- Inventor: Julia Komissarchik , Edward Komissarchik
- Applicant: Glenbrook Networks
- Applicant Address: US CA San Mateo
- Assignee: Glenbrook Networks
- Current Assignee: Glenbrook Networks
- Current Assignee Address: US CA San Mateo
- Agency: Foley & Lardner LLP
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G06F17/27

Abstract:
Provided are methods and systems that extract facts of unstructured documents and build an oracle for various domains. The present invention addresses the problem of efficient finding and extraction of facts about a particular subject domain from semi-structured and unstructured documents, makes inferences of new facts from the extracted facts and the ways of verification of the facts, thus becoming a source of knowledge about the domain to be effectively queried. The methods and systems can also extract temporal information from unstructured and semi-structured documents, and can find and extract dynamically generated documents from Deep or Dynamic Web.
Information query