Framework for workload prediction and physical database design

    公开(公告)号:US11789920B1

    公开(公告)日:2023-10-17

    申请号:US17705728

    申请日:2022-03-28

    Applicant: SAP SE

    CPC classification number: G06F16/2282 G06F11/3414 G06F16/213 G06F16/256

    Abstract: According to some embodiments, methods and systems may be associated with a cloud computing environment. A workload prediction framework may receive observed workload information associated with a database in the cloud computing environment (e.g., a DataBase as a Service (“DBaaS”)). Based on the observed workload information, a Statement Arrival Rate (“SAR”) prediction may be generated. In addition, a host variable assignment prediction may be generated based on the observed workload information. The workload prediction framework may then use the SAR prediction and the host variable assignment prediction to automatically create a workload prediction for the database. A physical database design advisor (e.g., a table partitioning advisor) may receive the workload prediction and, responsive to the workload prediction, automatically generate a recommended physical layout for the database (e.g., using a cost model, the current physical layout, and an objective function).

    DESIGN AND IMPLEMENTATION OF DATA ACCESS METRICS FOR AUTOMATED PHYSICAL DATABASE DESIGN

    公开(公告)号:US20220269653A1

    公开(公告)日:2022-08-25

    申请号:US17324914

    申请日:2021-05-19

    Applicant: SAP SE

    Abstract: The present disclosure involves systems, software, and computer implemented methods for improved design and implementation of data access metrics for automated physical database design. An example method includes identifying a database workload for which index advisor access counters are to be tracked. Each SQL statement in the database workload is executed. For each SQL statement, attribute sets are determined for which a selection predicate filters a result for an SQL statement. An output cardinality of each selection predicate is determined. A logarithmic counter for an attribute set corresponding to the selection predicate is determined based on the output cardinality of the selection predicate. The determined logarithmic counter is incremented. Respective values for logarithmic counters of the determined attributes are provided to an index advisor. The index advisor determines attribute sets for which to propose an index based on the logarithmic counters of the respective attribute sets.

    Compressing time stamp columns
    4.
    发明授权

    公开(公告)号:US11386104B2

    公开(公告)日:2022-07-12

    申请号:US16661993

    申请日:2019-10-23

    Applicant: SAP SE

    Abstract: Disclosed is a system and method for improving database memory consumption and performance using compression of time stamp columns. A number of time stamps of a time series is received. The time stamps have a start time, and are separated by an equal increment of time that defines an interval. The start time and interval are stored in a dictionary of a column store of a database. An index is generated in the column store of the database, the index having a number of index vectors. Using the index vectors, each time stamp of the number of time stamps can be calculated from the start time stored in the dictionary and the position in the time series based on the interval stored in the dictionary.

    Data access and recommendation system

    公开(公告)号:US11308047B2

    公开(公告)日:2022-04-19

    申请号:US16816511

    申请日:2020-03-12

    Applicant: SAP SE

    Abstract: System, method, and various embodiments for providing a data access and recommendation system are described herein. An embodiment operates by identifying a column access of one or more data values of a first column of a plurality of columns of a table of a database during a sampling period. A count of how many of the one or more data values are accessed during the column access are recorded. A first counter, corresponding to the first column and stored in a distributed hash table, is incremented by the count. The sampling period is determined to have expired. A load recommendation on how to load data values into the first column based on the first counter is computed. The load recommendation for implementation into the database for one or more subsequent column accesses is provided.

    DATA ACCESS AND RECOMMENDATION SYSTEM

    公开(公告)号:US20210286777A1

    公开(公告)日:2021-09-16

    申请号:US16816511

    申请日:2020-03-12

    Applicant: SAP SE

    Abstract: System, method, and various embodiments for providing a data access and recommendation system are described herein. An embodiment operates by identifying a column access of one or more data values of a first column of a plurality of columns of a table of a database during a sampling period. A count of how many of the one or more data values are accessed during the column access are recorded. A first counter, corresponding to the first column and stored in a distributed hash table, is incremented by the count. The sampling period is determined to have expired. A load recommendation on how to load data values into the first column based on the first counter is computed. The load recommendation for implementation into the database for one or more subsequent column accesses is provided.

    Paged Inverted Index
    8.
    发明申请

    公开(公告)号:US20170154061A1

    公开(公告)日:2017-06-01

    申请号:US14954736

    申请日:2015-11-30

    Applicant: SAP SE

    Abstract: Disclosed herein are system and method embodiments for generating a paged inverted index. An embodiment is generated by storing a first data structure and the second data structure in a plurality of pages, where the plurality of pages are stored in the one or more memories. The first data structure is stored in the plurality of pages and includes a plurality of value identifiers, where a value identifier corresponds to an offset. The second data structure stored in the plurality of pages includes a plurality of row positions, wherein a row position is at a location that corresponds to the offset in the first data structure and identifies a position of row in a table that stores data associated with the value ID.

    UNIFIED PERSISTENCE OVERALL FEATURE

    公开(公告)号:US20250165453A1

    公开(公告)日:2025-05-22

    申请号:US18515731

    申请日:2023-11-21

    Applicant: SAP SE

    Abstract: A database system detects an operation to move a first database object from an in-memory store to a persistence store, wherein the first database object is in either a page-loadable or column-loadable format in the in-memory store, and wherein the first database object comprises a plurality of subcomponents. Each subcomponent of the first database object is examined separately from other subcomponents of the plurality of subcomponents. In response to determining that a first subcomponent of the plurality of subcomponents has a datatype and a compression type supported by a unified persistence format, the first subcomponent is stored in a unified persistence format in the persistence store. In response to determining that a second subcomponent of the plurality of subcomponents has either a datatype or a compression type which is not supported by the unified persistence format, the second subcomponent is stored in a serial persistence format in the persistence store.

    Adaptive compression optimization for effective pruning

    公开(公告)号:US12204517B2

    公开(公告)日:2025-01-21

    申请号:US17933903

    申请日:2022-09-21

    Applicant: SAP SE

    Abstract: A database management system is described that can encode data to generate a plurality of data vectors. The database management system can perform the encoding by using a dictionary. The database management system can adaptively reorder the plurality of data vectors to prepare for compression of the plurality of data vectors. During a forward pass of the adaptive reordering, most frequent values of a data vector of the plurality of data vectors can be moved-up in the data vector. During a backward pass of the adaptive reordering, content within a rest range of a plurality of rest ranges can be rearranged within the plurality of data vectors according to frequencies of the content. The reordering according to frequency can further sort the rest range by value. Related apparatuses, systems, methods, techniques, computer programmable products, computer readable media, and articles are also described.

Patent Agency Ranking