-
公开(公告)号:US11461309B2
公开(公告)日:2022-10-04
申请号:US17238539
申请日:2021-04-23
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Prasanna Rajaperumal , Jiaqi Yan
Abstract: Systems, methods, and devices for incrementally refreshing a materialized view are disclosed. A method includes generating a materialized view based on a source table. The method includes merging the source table and the materialized view to generate a merged table to identify whether an update has been executed on the source table that is not reflected in the materialized view. The method includes, in response to detecting an update made to the source table that is not reflected in the materialized view, applying the update to the materialized view.
-
公开(公告)号:US11442917B2
公开(公告)日:2022-09-13
申请号:US17243795
申请日:2021-04-29
Applicant: Snowflake Inc.
Inventor: Jiaqi Yan , Thierry Cruanes , Jeffrey Rosen , William Waddington , Prasanna Rajaperumal , Abdul Munir
Abstract: Disclosed herein are systems and methods for incremental reclustering of database tables based on local maxima of partition overlap. In an embodiment, a database platform makes a determination, based on one or more incremental-reclustering criteria, to incrementally recluster a database table, which has a clustering key and which is stored across a plurality of partitions. In response to making the determination, the database platform selects a subset of the partitions, and at least incrementally reclusters the selected subset. The selecting of the subset includes identifying a local maximum of a quantity of overlapping partitions in the plurality of partitions with respect to a domain of the clustering key of the table, where the overlapping partitions overlap with respect to the clustering key.
-
公开(公告)号:US20220237161A1
公开(公告)日:2022-07-28
申请号:US17721938
申请日:2022-04-15
Applicant: Snowflake Inc.
Inventor: Jiaqi Yan , Qiuye Jin , Shrainik Jain , Stratis Viglas , Allison Lee
IPC: G06F16/21 , G06F11/34 , G06F11/07 , G06F16/215 , G06F11/30 , G06F16/2453
Abstract: Systems, methods, and devices for incremental feature development and workload capture in database systems are disclosed. The method includes determining a workload based on one or more client queries to be rerun for testing the feature release. The method further includes repeatedly executing a test run of the workload to identify whether there is a performance difference in the test run. The method further includes re-execute, in response to identifying the performance difference, the test run using resources with a different concurrency to identify whether the performance difference still exists. The method further includes flagging the performance difference as a false positive in response to identifying that the performance difference no longer exists when the test run is re-executed using the resources with different concurrency.
-
公开(公告)号:US11386059B2
公开(公告)日:2022-07-12
申请号:US17463514
申请日:2021-08-31
Applicant: SNOWFLAKE INC.
Inventor: Allison Lee , Shrainik Jain , Qiuye Jin , Stratis Viglas , Jiaqi Yan
IPC: G06F16/00 , G06F16/21 , G06F11/34 , G06F11/07 , G06F16/215 , G06F11/30 , G06F16/2453
Abstract: A baseline run of a workload and a target run of the workload are compared to identify whether there is a performance regression in the target run, wherein the target run implements a feature that is not implemented in the baseline run. In response to identifying the performance regression, the target run is executed to identify whether the performance regression still exists. The performance regression is flagged as a false positive in response to identifying that the performance regression no longer exists when the target run is executed.
-
公开(公告)号:US11347735B2
公开(公告)日:2022-05-31
申请号:US16889033
申请日:2020-06-01
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Igor Demura , Varun Ganesh , Prasanna Rajaperumal , Libo Wang , Jiaqi Yan
IPC: G06F16/2453
Abstract: Embodiments of the present disclosure may provide a dynamic query execution model. This query execution model may provide acceleration by scaling out parallel parts of a query (also referred to as a fragment) to additional computing resources, for example computing resources leased from a pool of computing resources. Execution of the parts of the query may be coordinated by a parent query coordinator, where the query originated, and a fragment query coordinator.
-
公开(公告)号:US20220138202A1
公开(公告)日:2022-05-05
申请号:US17540945
申请日:2021-12-02
Applicant: Snowflake Inc.
Inventor: Bicheng Gao , Srinivas Vemuri , Jiaqi Yan
IPC: G06F16/2455 , G06F11/34 , G06F16/248 , G06F16/22
Abstract: The subject technology receives, during a query compilation process, a query directed to a set of source tables, each source table from the set of source tables being organized into at least one micro-partition and the query including at least one pruning operation. The subject technology performs, during the query compilation process, a modification of the query for adjusting the at least one pruning operation, the modification being based at least in part on a set of statistics collected for previous pruning operations on at least a portion of the set of source tables and a set of heuristics. The subject technology compiles the query including the modification of the query. The subject technology provides the compiled query to an execution node of a database system for execution.
-
公开(公告)号:US20220067016A1
公开(公告)日:2022-03-03
申请号:US17511064
申请日:2021-10-26
Applicant: Snowflake Inc.
Inventor: Jiaqi Yan , Thierry Cruanes , Jeffrey Rosen , William Waddington , Prasanna Rajaperumal , Abdul Munir
Abstract: The subject technology determines whether a table is sufficiently clustered. The subject technology in response to determining the table is not sufficiently clustered, selects one or more micro-partitions of the table to be reclustered. The subject technology constructs a data structure for the table. The subject technology extracts minimum and maximum endpoints for each micro-partition in the data structure. The subject technology sorts each of one or more peaks in the data structure based on height. The subject technology sorts overlapping micro-partitions based on width. The subject technology selects based on which micro-partitions are within the tallest peaks of the one or more peaks and further based on which of the overlapping micro-partitions have the widest widths.
-
公开(公告)号:US20210397615A1
公开(公告)日:2021-12-23
申请号:US17462699
申请日:2021-08-31
Applicant: Snowflake Inc.
Inventor: Jeffrey Rosen , Abdul Munir , Jiaqi Yan , William Waddington , Prasanna Rajaperumal , Thierry Cruanes
IPC: G06F16/2458 , G06F16/2453 , G06F9/50 , G06F16/2455
Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.
-
公开(公告)号:US20210374135A1
公开(公告)日:2021-12-02
申请号:US16889033
申请日:2020-06-01
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Igor Demura , Varun Ganesh , Prasanna Rajaperumal , Libo Wang , Jiaqi Yan
IPC: G06F16/2453
Abstract: Embodiments of the present disclosure may provide a dynamic query execution model. This query execution model may provide acceleration by scaling out parallel parts of a query (also referred to as a fragment) to additional computing resources, for example computing resources leased from a pool of computing resources. Execution of the parts of the query may be coordinated by a parent query coordinator, where the query originated, and a fragment query coordinator.
-
公开(公告)号:US11163768B1
公开(公告)日:2021-11-02
申请号:US16889042
申请日:2020-06-01
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Igor Demura , Varun Ganesh , Prasanna Rajaperumal , Libo Wang , Jiaqi Yan
IPC: G06F16/2453 , G06F9/48 , G06F16/14 , G06F16/16 , G06F16/11
Abstract: Embodiments of the present disclosure may provide a dynamic query execution model with fault tolerance and failure recovery techniques. Embodiments of the present disclosure may utilize checkpoints to map processed output files to their corresponding input files. Therefore, if an error occurs in processing one or more files, the system may only need to reschedule processing of selected file(s).
-
-
-
-
-
-
-
-
-