Invention Grant
US09043197B1 Extracting information from unstructured text using generalized extraction patterns
有权
使用广义提取模式从非结构化文本中提取信息
- Patent Title: Extracting information from unstructured text using generalized extraction patterns
- Patent Title (中): 使用广义提取模式从非结构化文本中提取信息
-
Application No.: US11774428Application Date: 2007-07-06
-
Publication No.: US09043197B1Publication Date: 2015-05-26
- Inventor: Alexandru Marius Pasca , Dekang Lin
- Applicant: Alexandru Marius Pasca , Dekang Lin
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F17/21 ; G06F17/28

Abstract:
Methods, systems, and apparatus, including computer program products, for extracting information from unstructured text. Fact pairs are used to extract basic patterns from a body of text. Patterns are generalized by replacing words with classes of similar words. Generalized patterns are used to extract further fact pairs from the body of text. The process can begin with fact pairs, basic patterns, or generalized patterns.
Information query