Invention Grant
- Patent Title: Search index format optimizations
- Patent Title (中): 搜索索引格式优化
-
Application No.: US12139213Application Date: 2008-06-13
-
Publication No.: US08166041B2Publication Date: 2012-04-24
- Inventor: Chadd Creighton Merrigan , Mihai Petriuc , Raif Khassanov , Artsiom Ivanovich Kokhan
- Applicant: Chadd Creighton Merrigan , Mihai Petriuc , Raif Khassanov , Artsiom Ivanovich Kokhan
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Merchant & Gould, P.C.
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
A search index structure which extends a typical composite index by incorporating an index which is optimized for fast retrieval from storage and which eliminates data which is specific to phrase searching. Other data is represented in a manner which allows it to be calculated rather than stored. Associating variable length entries with logical categories allows their length to be inferred from the category rather than stored. Using delta values between document IDs rather than the ID itself generates a compact, dense symbol set which is efficiently compressed by Huffman encoding or a similar compression method. Using an upper threshold to remove large, and thus rare, delta values from the symbol set prior to encoding further improves the encoding performance.
Public/Granted literature
- US20090313238A1 SEARCH INDEX FORMAT OPTIMIZATIONS Public/Granted day:2009-12-17
Information query