Increasing clustering of partitions of a table

    公开(公告)号:US11100142B2

    公开(公告)日:2021-08-24

    申请号:US17030549

    申请日:2020-09-24

    Applicant: Snowflake Inc.

    Abstract: The subject technology creates one or more partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition, the creating of the one or more partitions resulting in a decrease in a degree of clustering of the table. The subject technology determines, after creating the one or more partitions, that the degree of clustering of the table is below a clustering threshold, the clustering threshold corresponding to a clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes. The subject technology increases the degree of clustering of the table by reclustering partitions of the table.

    INCREMENTAL RECLUSTERING BASED ON LOCAL MAXIMA OF PARTITION OVERLAP

    公开(公告)号:US20210248126A1

    公开(公告)日:2021-08-12

    申请号:US17243795

    申请日:2021-04-29

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are systems and methods for incremental reclustering of database tables based on local maxima of partition overlap. In an embodiment, a database platform makes a determination, based on one or more incremental-reclustering criteria, to incrementally recluster a database table, which has a clustering key and which is stored across a plurality of partitions. In response to making the determination, the database platform selects a subset of the partitions, and at least incrementally reclusters the selected subset. The selecting of the subset includes identifying a local maximum of a quantity of overlapping partitions in the plurality of partitions with respect to a domain of the clustering key of the table, where the overlapping partitions overlap with respect to the clustering key.

    INCREMENTAL REFRESH OF A MATERIALIZED VIEW

    公开(公告)号:US20210240695A1

    公开(公告)日:2021-08-05

    申请号:US17238539

    申请日:2021-04-23

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for incrementally refreshing a materialized view are disclosed. A method includes generating a materialized view based on a source table. The method includes merging the source table and the materialized view to generate a merged table to identify whether an update has been executed on the source table that is not reflected in the materialized view. The method includes, in response to detecting an update made to the source table that is not reflected in the materialized view, applying the update to the materialized view.

    MAINTAINING STATES OF PARTITIONS OF A TABLE FOR ADJUSTING CLUSTERING RATIOS

    公开(公告)号:US20210216574A1

    公开(公告)日:2021-07-15

    申请号:US17219609

    申请日:2021-03-31

    Applicant: Snowflake Inc.

    Abstract: The subject technology creates partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition. The subject technology maintains states for the partitions, each state from the plurality of states representing a particular degree of clustering of the table. The subject technology determines a number of overlapping partitions and a depth of the overlapping partitions, and determines a clustering ratio based at least in part on the number of overlapping partitions and the depth. The subject technology reclusters partitions of the table to increase the clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes.

    Flexible computing
    95.
    发明授权

    公开(公告)号:US11055142B1

    公开(公告)日:2021-07-06

    申请号:US17084861

    申请日:2020-10-30

    Applicant: Snowflake Inc.

    Abstract: Embodiments of the present disclosure may provide dynamic and fair assignment techniques for allocating resources on a demand basis. Assignment control may be separated into at least two components: a local component and a global component. Each component may have an active dialog with each other; the dialog may include two aspects: 1) a demand for computing resources, and 2) a total allowed number of computing resources. The global component may allocate resources from a pool of resources to different local components, and the local components in turn may assign their allocated resources to local competing requests. The allocation may also be throttled or limited at various levels.

    RECLUSTERING OF DATABASE TABLES BASED ON PEAKS AND WIDTHS

    公开(公告)号:US20210200737A1

    公开(公告)日:2021-07-01

    申请号:US17249796

    申请日:2021-03-12

    Applicant: Snowflake Inc.

    Abstract: The subject technology determines whether a table is sufficiently clustered. The subject technology in response to determining the table is not sufficiently clustered, selects one or more micro-partitions of the table to be reclustered. The subject technology constructs a data structure for the table. The subject technology extracts minimum and maximum endpoints for each micro-partition in the data structure. The subject technology sorts each of one or more peaks in the data structure based on height. The subject technology sorts overlapping micro-partitions based on width. The subject technology selects based on which micro-partitions are within the tallest peaks of the one or more peaks and further based on which of the overlapping micro-partitions have the widest widths.

    Data sharing in database systems
    97.
    发明授权

    公开(公告)号:US10997169B2

    公开(公告)日:2021-05-04

    申请号:US17125453

    申请日:2020-12-17

    Applicant: Snowflake Inc.

    Abstract: Using one or more processors, generating a materialized view over a share object, the share object, including data associated with a first account. Using the one or more processors, merging the materialized view and the share object. Using the one or more processors, updating the data associated with the first account subsequent to generating the materialized view. Using the one or more processors, generating a refreshed materialized view with respect to the share object in response to detecting the updated data. Using the one or more processors, sharing the refreshed materialized view to a second account in response to a request from the second account.

    DATA SHARING IN DATABASE SYSTEMS
    99.
    发明申请

    公开(公告)号:US20210103587A1

    公开(公告)日:2021-04-08

    申请号:US17125453

    申请日:2020-12-17

    Applicant: Snowflake Inc.

    Abstract: The subject technology generates, using one or more processors, a materialized view over a share object, the share object including data associated with a first account. The subject technology merges, using the one or more processors, the materialized view and the share object. The subject technology updates, using the one or more processors, the data associated with the first account subsequent to generating the materialized view. The subject technology generates, using the one or more processors, a refreshed materialized view with respect to the share object in response to detecting the updated data. The subject technology shares, using the one or more processors, the refreshed materialized view to a second account in response to a request from the second account.

Patent Agency Ranking