Cutoffs for pruning of database queries

    公开(公告)号:US11755581B2

    公开(公告)日:2023-09-12

    申请号:US17822264

    申请日:2022-08-25

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives, during a query compilation process, a query directed to a set of source tables. The subject technology performs, during the query compilation process, a modification of the query for adjusting at least one pruning operation. The subject technology determines, during a pruning process of a second query, the second query directed to a set of files in a database system and including a set of pruning operations on the set of files, whether to perform a pruning cutoff on the set of pruning operations, the pruning process performing a depth first search of a pruner tree structure, the set of files comprising a set of micro-partitions. The subject technology performs the pruning cutoff based on the determining, the pruning cutoff ceasing at least one pruning operation from the set of pruning operations.

    Caching query plans in database systems

    公开(公告)号:US11645281B1

    公开(公告)日:2023-05-09

    申请号:US17823310

    申请日:2022-08-30

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24542 G06F16/24539

    Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology invokes a compiler to perform a compilation process based on the compilation context. The subject technology performs a lookup operation on a stored plan cache for an exact match based on information from the compilation context. The subject technology, in response to determining an exact match, determines whether the particular query plan requires re-compilation based on a data dependent optimization. The subject technology determines whether a plan cache entry corresponding to the particular query plan includes a data property constraint. The subject technology determines whether the data property constraint still holds based on a set of data properties.

    TRANSIENT MATERIALIZED VIEW REWRITE
    125.
    发明申请

    公开(公告)号:US20230087933A1

    公开(公告)日:2023-03-23

    申请号:US18059125

    申请日:2022-11-28

    Applicant: Snowflake Inc.

    Abstract: Queries executed against a materialized view can execute up to orders of magnitude faster than equivalent queries on a source (or base) table. However, although a query can reference a materialized view directly, a user (e.g., query author) may not know about a relevant materialized view. Moreover, if a source table has multiple materialized views generated, the user may not know which materialized view to reference in the query. Thus, embodiments of the present disclosure provide techniques for automatically rewriting queries directed to a source table to utilize existing materialized views.

    Selecting partitions for reclustering based on distribution of overlapping partitions

    公开(公告)号:US11544244B2

    公开(公告)日:2023-01-03

    申请号:US17654296

    申请日:2022-03-10

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are embodiments of systems and methods for selecting partitions for reclustering based on distribution of overlapping partitions. In an example, a database platform makes a determination to at least partially recluster a database table that includes data stored across a plurality of partitions. The database platform responsively selects a subset of the partitions. The selecting of the subset includes identifying a point on a domain of a clustering key that corresponds to a local maximum of overlapping partitions, and also includes selecting the subset from among a group of overlapping partitions. The group includes at least one partition that overlaps the identified point on the domain of the clustering key. Each partition in the selected subset is above a reduction goal of overlapping partitions. The database platform at least partially reclusters the selected subset based on the clustering key.

    FEATURE RELEASE AND WORKLOAD CAPTURE IN DATABASE SYSTEMS

    公开(公告)号:US20220358107A1

    公开(公告)日:2022-11-10

    申请号:US17869071

    申请日:2022-07-20

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for feature release and workload capture in database systems are disclosed. The method includes determining a workload based on one or more client queries to be rerun to test a feature that is unreleased to one or more database clients. The method includes repeatedly executing a test run of the workload to determine a stability factor of the test run. The method includes re-executing, in response to determining the stability factor of the test run, the test run using resources with a different concurrency to confirm the stability factor of the test run. The method includes releasing the feature to the one or more database clients in response to confirming the stability factor of the test run.

    Query plan caching for networked database systems

    公开(公告)号:US11461327B1

    公开(公告)日:2022-10-04

    申请号:US17658535

    申请日:2022-04-08

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology provides the compilation context to a compiler. The subject technology invokes the compiler to perform a compilation process based on the compilation context, the compilation process comprising performing a lookup operation on a stored plan cache for an exact match based on information from the compilation context, the stored plan cache including a set of stored query plans, and determining whether the exact match of a particular query plan is found in the stored plan cache to avoid compiling the query using the compilation context.

Patent Agency Ranking