Invention Grant
US08255394B2 Apparatus, system, and method for efficient content indexing of streaming XML document content
失效
用于流式XML文档内容的高效内容索引的设备,系统和方法
- Patent Title: Apparatus, system, and method for efficient content indexing of streaming XML document content
- Patent Title (中): 用于流式XML文档内容的高效内容索引的设备,系统和方法
-
Application No.: US12475999Application Date: 2009-06-01
-
Publication No.: US08255394B2Publication Date: 2012-08-28
- Inventor: James P. Branigan , David P. Charboneau , Simon K. Johnston
- Applicant: James P. Branigan , David P. Charboneau , Simon K. Johnston
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Kunzler Needham Massey & Thorpe
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
An apparatus, system, and method are disclosed for efficient content indexing of streaming XML document content. A forest generator generates an XML pattern forest from a set of structured index path expressions, the XML pattern forest includes trees and twigs generated from structured index path expressions uniquely associated with a namespace indicator for an XML node. The XML node is identified in a stream of at least one XML document. A comparison module compares the XML node to nodes of trees and twigs of the XML pattern forest. A determination module determines a match between the XML node and an index node in one of a tree and a twig of the XML pattern forest. The index node has a path from an ancestor node to the index node that matches the axis steps of at least one of the structured index path expressions. A storage module stores an index entry for the XML node in response to the determined match, the index entry includes a XML document identifier, an XML node name, a namespace indicator for the XML node, and XML node content.
Public/Granted literature
- US20100306273A1 APPARATUS, SYSTEM, AND METHOD FOR EFFICIENT CONTENT INDEXING OF STREAMING XML DOCUMENT CONTENT Public/Granted day:2010-12-02
Information query