Invention Grant
- Patent Title: Columnar database compression
-
Application No.: US16203650Application Date: 2018-11-29
-
Publication No.: US11036684B2Publication Date: 2021-06-15
- Inventor: Sami Abed , Pedro Barbas , Austin Clifford , Konrad Emanowicz
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Edward J. Wixted, III
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F16/174 ; G06F16/22 ; G06F16/27 ; G06F16/2457 ; G06N20/00

Abstract:
Disclosed is an approach comprising a column partitioned into a plurality of partitions including an empty partition and a plurality of filled partitions each comprising data entries associated with a set of parameters having parameter values, the data entries compressed in accordance with a compression dictionary. The approach comprises receiving forecasted parameter values for an expected set of data entries to be stored in an empty partition; predicting a recurrence frequency of the data entries in the expected set using the forecasted parameter values by evaluating the respective compression dictionaries of the filled partitions with a machine learning algorithm; generating a predictive compression dictionary for the expected set of data entries based on the predicted recurrence frequency of the data entries in the expected set; receiving the expected set of data entries; and compressing at least part of the received expected set of data entries using the predictive compression dictionary.
Public/Granted literature
- US20190095461A1 COLUMNAR DATABASE COMPRESSION Public/Granted day:2019-03-28
Information query