Memory-constrained aggregation using intra-operator pipelining

    公开(公告)号:US10114866B2

    公开(公告)日:2018-10-30

    申请号:US15040501

    申请日:2016-02-10

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for constraining the amount of memory used during data aggregation. An embodiment operates by separating input data into a plurality of partitions. The embodiment then inserts portions of the input data into blocks from a free list at a given level of a pipeline. The embodiment then inserts the blocks into buffers for processing at a subsequent level of the pipeline. The embodiment processes the inserted blocks at the subsequent level of the pipeline and concatenates the intermediate results into a final aggregate result.

    Aggregating database entries by hashing

    公开(公告)号:US10055480B2

    公开(公告)日:2018-08-21

    申请号:US14726251

    申请日:2015-05-29

    Applicant: SAP SE

    Abstract: Aggregating input into hashtables using just-in-time compilation of compilable code in response to a database query. Compilable code can be generated that is configured to cause a programmable processor to produce one or more hashmaps based on the input database. The one or more hashmaps can correspond to each individual thread from the input. The compilable code can be configured to cause the one or more processors to insert the hashmaps into a scheduler. Compilable code can be generated that is configured to: aggregate elements from the one or more hashmaps into buckets of elements having the same partition identity; rehash the buckets of elements having the same partition identity to reduce the number of groups within the bucket; facilitate the merger of all non-empty elements from each target-partition into a merged-partition.

    ADAPTIVE DICTIONARY COMPRESSION/DECOMPRESSION FOR COLUMN-STORE DATABASES

    公开(公告)号:US20190155788A1

    公开(公告)日:2019-05-23

    申请号:US16255622

    申请日:2019-01-23

    Applicant: SAP SE

    CPC classification number: G06F16/17 G06F16/221

    Abstract: Innovations for adaptive compression and decompression for dictionaries of a column-store database can reduce the amount of memory used for columns of the database, allowing a system to keep column data in memory for more columns, while delays for access operations remain acceptable. For example, dictionary compression variants use different compression techniques and implementation options, Some dictionary compression variants provide more aggressive compression (reduced memory consumption) but result in slower run-time performance. Other dictionary compression variants provide less aggressive compression (higher memory consumption) but support faster run-time performance. As another example, a compression manager can automatically select a dictionary compression variant for a given column in a column-store database. For different dictionary compression variants, the compression manager predicts run-time performance and compressed dictionary size, given the values of the column, and selects one of the dictionary compression variants.

    AGGREGATING DATABASE ENTRIES BY HASHING
    4.
    发明申请
    AGGREGATING DATABASE ENTRIES BY HASHING 审中-公开
    通过洗涤聚合数据库入口

    公开(公告)号:US20160350394A1

    公开(公告)日:2016-12-01

    申请号:US14726251

    申请日:2015-05-29

    Applicant: SAP SE

    Abstract: Aggregating input into hashtables using just-in-time compilation of compilable code in response to a database query. Compilable code can be generated that is configured to cause a programmable processor to produce one or more hashmaps based on the input database. The one or more hashmaps can correspond to each individual thread from the input. The compilable code can be configured to cause the one or more processors to insert the hashmaps into a scheduler. Compilable code can be generated that is configured to: aggregate elements from the one or more hashmaps into buckets of elements having the same partition identity; rehash the buckets of elements having the same partition identity to reduce the number of groups within the bucket; facilitate the merger of all non-empty elements from each target-partition into a merged-partition.

    Abstract translation: 使用即时汇编可编译代码来响应数据库查询将输入汇总到散列表中。 可以生成可编译代码,该代码被配置为使可编程处理器基于输入数据库产生一个或多个哈希图。 一个或多个hashmaps可以对应于来自输入的每个单独的线程。 可编译代码可以配置为使一个或多个处理器将hashmaps插入调度程序。 可以生成可编译代码,其被配置为:将元素从一个或多个hashmaps聚合到具有相同分区标识的元素的桶中; 重新分配具有相同分区身份的元素桶以减少桶内的组数; 便于将所有非空元素从每个目标分区合并成合并分区。

    Adaptive dictionary compression/decompression for column-store databases

    公开(公告)号:US10824596B2

    公开(公告)日:2020-11-03

    申请号:US16255622

    申请日:2019-01-23

    Applicant: SAP SE

    Abstract: Innovations for adaptive compression and decompression for dictionaries of a column-store database can reduce the amount of memory used for columns of the database, allowing a system to keep column data in memory for more columns, while delays for access operations remain acceptable. For example, dictionary compression variants use different compression techniques and implementation options, Some dictionary compression variants provide more aggressive compression (reduced memory consumption) but result in slower run-time performance. Other dictionary compression variants provide less aggressive compression (higher memory consumption) but support faster run-time performance. As another example, a compression manager can automatically select a dictionary compression variant for a given column in a column-store database. For different dictionary compression variants, the compression manager predicts run-time performance and compressed dictionary size, given the values of the column, and selects one of the dictionary compression variants.

    Dynamic hash table size estimation during database aggregation processing

    公开(公告)号:US10127281B2

    公开(公告)日:2018-11-13

    申请号:US15016978

    申请日:2016-02-05

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for database aggregation optimization. An embodiment operates by receiving data from a main memory. Within a cache, a first hash table comprising an aggregate hash of a first portion of the data is generated. A second portion of data is partitioned into one or more of partitions. Within the cache, one or more intermediate hash tables are generated. A first hash table is aggregated based on the one or more intermediate hash tables. At least a portion of the data of the final hash table is provided responsive to a query.

Patent Agency Ranking