Invention Grant
US08010544B2 Inverted indices in information extraction to improve records extracted per annotation
有权
信息提取中的反向索引,以提高每个注释提取的记录
- Patent Title: Inverted indices in information extraction to improve records extracted per annotation
- Patent Title (中): 信息提取中的反向索引,以提高每个注释提取的记录
-
Application No.: US12135070Application Date: 2008-06-06
-
Publication No.: US08010544B2Publication Date: 2011-08-30
- Inventor: Mahesh Tiyyagura
- Applicant: Mahesh Tiyyagura
- Applicant Address: US CA Sunnyvale
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Hickman Palermo Truong & Becker LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A method is provided for information extraction from among a multiplicity of documents each having a corresponding document object model (DOM) comprising: computing signatures associated with nodes of a multiplicity of DOMs corresponding to the multiplicity of documents; producing an index that associates computed signatures to each document that has a DOM that has one or more nodes corresponding to such signature; annotating one or more nodes of a DOM that corresponds to the at least one selected document; wherein the one or more annotated nodes respectively correspond to one or more respective signatures included in the index; and matching the signatures that correspond to the annotated nodes with signatures in the index to determine which documents from the multiplicity of documents have one or more DOM nodes that correspond to one or more of the annotated nodes.
Public/Granted literature
- US20090307256A1 INVERTED INDICES IN INFORMATION EXTRACTION TO IMPROVE RECORDS EXTRACTED PER ANNOTATION Public/Granted day:2009-12-10
Information query