Invention Grant
- Patent Title: Indexing for regular expressions in text-centric applications
- Patent Title (中): 以文本为中心的应用程序索引正则表达式
-
Application No.: US13585447Application Date: 2012-08-14
-
Publication No.: US08548979B2Publication Date: 2013-10-01
- Inventor: Ting Chen , Rajasekar Krishnamurthy , Shivakumar Vaithyanathan
- Applicant: Ting Chen , Rajasekar Krishnamurthy , Shivakumar Vaithyanathan
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Lieberman & Brandsdorfer, LLC
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
A method, system, and article are provided for evaluating regular expressions over large data collections. A general purpose index is built to handle complex regular expressions at the character level. Characters, character classes, and associated metadata are identified and stored in an index of a collection of documents. Given a regular expression, a query is generated based on the contents of the index. This query is executed over the index to identify a set of documents in the collection of documents over which the regular expression can be evaluated. Based upon the query execution, the identified set of documents is returned for evaluation by the regular expression responsive to execution of the query over the index.
Public/Granted literature
- US20120310948A1 Indexing for Regular Expressions in Text-Centric Applications Public/Granted day:2012-12-06
Information query