Incremental refresh of a materialized view

    公开(公告)号:US11461309B2

    公开(公告)日:2022-10-04

    申请号:US17238539

    申请日:2021-04-23

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for incrementally refreshing a materialized view are disclosed. A method includes generating a materialized view based on a source table. The method includes merging the source table and the materialized view to generate a merged table to identify whether an update has been executed on the source table that is not reflected in the materialized view. The method includes, in response to detecting an update made to the source table that is not reflected in the materialized view, applying the update to the materialized view.

    Incremental reclustering based on local maxima of partition overlap

    公开(公告)号:US11442917B2

    公开(公告)日:2022-09-13

    申请号:US17243795

    申请日:2021-04-29

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are systems and methods for incremental reclustering of database tables based on local maxima of partition overlap. In an embodiment, a database platform makes a determination, based on one or more incremental-reclustering criteria, to incrementally recluster a database table, which has a clustering key and which is stored across a plurality of partitions. In response to making the determination, the database platform selects a subset of the partitions, and at least incrementally reclusters the selected subset. The selecting of the subset includes identifying a local maximum of a quantity of overlapping partitions in the plurality of partitions with respect to a domain of the clustering key of the table, where the overlapping partitions overlap with respect to the clustering key.

    INCREMENTAL FEATURE DEVELOPMENT AND WORKLOAD CAPTURE IN DATABASE SYSTEMS

    公开(公告)号:US20220237161A1

    公开(公告)日:2022-07-28

    申请号:US17721938

    申请日:2022-04-15

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for incremental feature development and workload capture in database systems are disclosed. The method includes determining a workload based on one or more client queries to be rerun for testing the feature release. The method further includes repeatedly executing a test run of the workload to identify whether there is a performance difference in the test run. The method further includes re-execute, in response to identifying the performance difference, the test run using resources with a different concurrency to identify whether the performance difference still exists. The method further includes flagging the performance difference as a false positive in response to identifying that the performance difference no longer exists when the test run is re-executed using the resources with different concurrency.

    Scalable query processing
    55.
    发明授权

    公开(公告)号:US11347735B2

    公开(公告)日:2022-05-31

    申请号:US16889033

    申请日:2020-06-01

    Applicant: Snowflake Inc.

    Abstract: Embodiments of the present disclosure may provide a dynamic query execution model. This query execution model may provide acceleration by scaling out parallel parts of a query (also referred to as a fragment) to additional computing resources, for example computing resources leased from a pool of computing resources. Execution of the parts of the query may be coordinated by a parent query coordinator, where the query originated, and a fragment query coordinator.

    PRUNING CUTOFFS FOR DATABASE SYSTEMS

    公开(公告)号:US20220138202A1

    公开(公告)日:2022-05-05

    申请号:US17540945

    申请日:2021-12-02

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives, during a query compilation process, a query directed to a set of source tables, each source table from the set of source tables being organized into at least one micro-partition and the query including at least one pruning operation. The subject technology performs, during the query compilation process, a modification of the query for adjusting the at least one pruning operation, the modification being based at least in part on a set of statistics collected for previous pruning operations on at least a portion of the set of source tables and a set of heuristics. The subject technology compiles the query including the modification of the query. The subject technology provides the compiled query to an execution node of a database system for execution.

    INCREMENTAL RECLUSTERING OF DATABASE TABLES USING RECLUSTERING-COUNT LEVELS

    公开(公告)号:US20220067016A1

    公开(公告)日:2022-03-03

    申请号:US17511064

    申请日:2021-10-26

    Applicant: Snowflake Inc.

    Abstract: The subject technology determines whether a table is sufficiently clustered. The subject technology in response to determining the table is not sufficiently clustered, selects one or more micro-partitions of the table to be reclustered. The subject technology constructs a data structure for the table. The subject technology extracts minimum and maximum endpoints for each micro-partition in the data structure. The subject technology sorts each of one or more peaks in the data structure based on height. The subject technology sorts overlapping micro-partitions based on width. The subject technology selects based on which micro-partitions are within the tallest peaks of the one or more peaks and further based on which of the overlapping micro-partitions have the widest widths.

    RESOURCE PROVISIONING IN DATABASE SYSTEMS

    公开(公告)号:US20210397615A1

    公开(公告)日:2021-12-23

    申请号:US17462699

    申请日:2021-08-31

    Applicant: Snowflake Inc.

    Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.

    SCALABLE QUERY PROCESSING
    59.
    发明申请

    公开(公告)号:US20210374135A1

    公开(公告)日:2021-12-02

    申请号:US16889033

    申请日:2020-06-01

    Applicant: Snowflake Inc.

    Abstract: Embodiments of the present disclosure may provide a dynamic query execution model. This query execution model may provide acceleration by scaling out parallel parts of a query (also referred to as a fragment) to additional computing resources, for example computing resources leased from a pool of computing resources. Execution of the parts of the query may be coordinated by a parent query coordinator, where the query originated, and a fragment query coordinator.

Patent Agency Ranking