-
公开(公告)号:US11782890B2
公开(公告)日:2023-10-10
申请号:US17842642
申请日:2022-06-16
Applicant: SNOWFLAKE INC.
Inventor: Allison Lee , Shrainik Jain , Qiuye Jin , Stratis Viglas , Jiaqi Yan
IPC: G06F16/00 , G06F16/21 , G06F11/34 , G06F11/07 , G06F16/215 , G06F11/30 , G06F16/2453
CPC classification number: G06F16/217 , G06F11/0772 , G06F11/3072 , G06F11/3428 , G06F11/3452 , G06F16/215 , G06F16/24549
Abstract: A system to repeatedly execute a test run of a workload using resources of a cloud environment to determine whether there is a performance difference in the test run. The system to, in response to determining that there is no performance difference, identify one or more sets of decreased resources of the cloud environment. The system to re-execute the test run using the one or more sets of decreased resources of the cloud environment to determine whether there is a performance difference in the test run that is attributed to the one or more sets of decreased resources of the cloud environment. The system to determine minimum resources of the cloud environment to repeatedly execute the test run using the minimum resources without existence of a performance difference in response to re-executing the test run using the one or more sets of decreased resources of the cloud environment.
-
公开(公告)号:US11755581B2
公开(公告)日:2023-09-12
申请号:US17822264
申请日:2022-08-25
Applicant: Snowflake Inc.
Inventor: Bicheng Gao , Srinivas Vemuri , Jiaqi Yan
IPC: G06F16/2455 , G06F11/34 , G06F16/248 , G06F16/22 , G06F16/2453
CPC classification number: G06F16/24554 , G06F11/3409 , G06F16/2282 , G06F16/248 , G06F16/24534
Abstract: The subject technology receives, during a query compilation process, a query directed to a set of source tables. The subject technology performs, during the query compilation process, a modification of the query for adjusting at least one pruning operation. The subject technology determines, during a pruning process of a second query, the second query directed to a set of files in a database system and including a set of pruning operations on the set of files, whether to perform a pruning cutoff on the set of pruning operations, the pruning process performing a depth first search of a pruner tree structure, the set of files comprising a set of micro-partitions. The subject technology performs the pruning cutoff based on the determining, the pruning cutoff ceasing at least one pruning operation from the set of pruning operations.
-
公开(公告)号:US11645281B1
公开(公告)日:2023-05-09
申请号:US17823310
申请日:2022-08-30
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Xuelai Cui , Sangyong Hwang , Allison Waingold Lee , Boyung Lee , Nicola Dan Onose , William Waddington , Jiaqi Yan , Li Yan , Yongsik Yoon
IPC: G06F16/2453
CPC classification number: G06F16/24542 , G06F16/24539
Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology invokes a compiler to perform a compilation process based on the compilation context. The subject technology performs a lookup operation on a stored plan cache for an exact match based on information from the compilation context. The subject technology, in response to determining an exact match, determines whether the particular query plan requires re-compilation based on a data dependent optimization. The subject technology determines whether a plan cache entry corresponding to the particular query plan includes a data property constraint. The subject technology determines whether the data property constraint still holds based on a set of data properties.
-
公开(公告)号:US11625367B1
公开(公告)日:2023-04-11
申请号:US17934369
申请日:2022-09-22
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Benoit Dageville , Ganeshan Ramachandran Iyer , Yucan Liu , Jiaqi Yan
Abstract: Techniques for schema mismatch detection and evolution are described. When data is being uploaded into a source table, schema of the data to be uploaded can be compared with the schema for the source table. If a schema mismatch is detected, the schema of the source table can be modified, and the upload can be continued without data loss.
-
公开(公告)号:US20230087933A1
公开(公告)日:2023-03-23
申请号:US18059125
申请日:2022-11-28
Applicant: Snowflake Inc.
Inventor: Shrainik Jain , Allison Waingold Lee , Boyung Lee , Prasanna Rajaperumal , Jiaqi Yan , Yongsik Yoon
IPC: G06F16/2453
Abstract: Queries executed against a materialized view can execute up to orders of magnitude faster than equivalent queries on a source (or base) table. However, although a query can reference a materialized view directly, a user (e.g., query author) may not know about a relevant materialized view. Moreover, if a source table has multiple materialized views generated, the user may not know which materialized view to reference in the query. Thus, embodiments of the present disclosure provide techniques for automatically rewriting queries directed to a source table to utilize existing materialized views.
-
公开(公告)号:US20230028008A1
公开(公告)日:2023-01-26
申请号:US17930165
申请日:2022-09-07
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Igor Demura , Varun Ganesh , Prasanna Rajaperumal , Libo Wang , Jiaqi Yan
IPC: G06F16/2453 , G06F9/48 , G06F16/14 , G06F16/16 , G06F16/11
Abstract: Embodiments of the present disclosure may provide a dynamic query execution model with fault tolerance and failure recovery techniques. Embodiments of the present disclosure may utilize checkpoints to map processed output files to their corresponding input files. Therefore, if an error occurs in processing one or more files, the system may only need to reschedule processing of selected file(s).
-
公开(公告)号:US11544244B2
公开(公告)日:2023-01-03
申请号:US17654296
申请日:2022-03-10
Applicant: Snowflake Inc.
Inventor: Jiaqi Yan , Thierry Cruanes , Jeffrey Rosen , William Waddington , Prasanna Rajaperumal , Abdul Munir
Abstract: Disclosed herein are embodiments of systems and methods for selecting partitions for reclustering based on distribution of overlapping partitions. In an example, a database platform makes a determination to at least partially recluster a database table that includes data stored across a plurality of partitions. The database platform responsively selects a subset of the partitions. The selecting of the subset includes identifying a point on a domain of a clustering key that corresponds to a local maximum of overlapping partitions, and also includes selecting the subset from among a group of overlapping partitions. The group includes at least one partition that overlaps the identified point on the domain of the clustering key. Each partition in the selected subset is above a reduction goal of overlapping partitions. The database platform at least partially reclusters the selected subset based on the clustering key.
-
公开(公告)号:US20220358107A1
公开(公告)日:2022-11-10
申请号:US17869071
申请日:2022-07-20
Applicant: Snowflake Inc.
Inventor: Jiaqi Yan , Qiuye Jin , Shrainik Jain , Stratis Viglas , Allison Lee
IPC: G06F16/21 , G06F11/34 , G06F11/07 , G06F16/215 , G06F11/30 , G06F16/2453
Abstract: Systems, methods, and devices for feature release and workload capture in database systems are disclosed. The method includes determining a workload based on one or more client queries to be rerun to test a feature that is unreleased to one or more database clients. The method includes repeatedly executing a test run of the workload to determine a stability factor of the test run. The method includes re-executing, in response to determining the stability factor of the test run, the test run using resources with a different concurrency to confirm the stability factor of the test run. The method includes releasing the feature to the one or more database clients in response to confirming the stability factor of the test run.
-
公开(公告)号:US11461327B1
公开(公告)日:2022-10-04
申请号:US17658535
申请日:2022-04-08
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Xuelai Cui , Sangyong Hwang , Allison Waingold Lee , Boyung Lee , Nicola Dan Onose , William Waddington , Jiaqi Yan , Li Yan , Yongsik Yoon
IPC: G06F16/2453
Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology provides the compilation context to a compiler. The subject technology invokes the compiler to perform a compilation process based on the compilation context, the compilation process comprising performing a lookup operation on a stored plan cache for an exact match based on information from the compilation context, the stored plan cache including a set of stored query plans, and determining whether the exact match of a particular query plan is found in the stored plan cache to avoid compiling the query using the compilation context.
-
公开(公告)号:US11461325B2
公开(公告)日:2022-10-04
申请号:US17333358
申请日:2021-05-28
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Igor Demura , Varun Ganesh , Prasanna Rajaperumal , Libo Wang , Jiaqi Yan
Abstract: Embodiments of the present disclosure may provide a dynamic query execution model with fault tolerance and failure recovery techniques. Embodiments of the present disclosure may utilize checkpoints to map processed output files to their corresponding input files. Therefore, if an error occurs in processing one or more files, the system may only need to reschedule processing of selected file(s).
-
-
-
-
-
-
-
-
-