SCALABLE QUERY PROCESSING
    22.
    发明申请

    公开(公告)号:US20220414097A1

    公开(公告)日:2022-12-29

    申请号:US17823572

    申请日:2022-08-31

    Applicant: Snowflake Inc.

    Abstract: Embodiments of the present disclosure may provide a dynamic query execution model. This query execution model may provide acceleration by scaling out parallel parts of a query (also referred to as a fragment) to additional computing resources, for example computing resources leased from a pool of computing resources. Execution of the parts of the query may be coordinated by a parent query coordinator, where the query originated, and a fragment query coordinator.

    Merge small file consolidation
    23.
    发明授权

    公开(公告)号:US11537613B1

    公开(公告)日:2022-12-27

    申请号:US17514084

    申请日:2021-10-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query plan corresponding to a query. The subject technology executes the query based at least in part on the query plan, the executing including: filtering a first set of files that are to be modified by a merge statement, performing a split operation to send information related to a second set of files to a scan set builder operation in a first portion of the query plan and scan back operation in a second portion of the query plan, performing the scan set builder operation to remove the second set of files from the first set of files, performing a table scan operation based on a third set of files, and performing a first union all operation to combine the first set of data with a second set of data as a first set of combined data.

    SYSTEM AND METHOD FOR DISJUNCTIVE JOINS

    公开(公告)号:US20220391390A1

    公开(公告)日:2022-12-08

    申请号:US17879615

    申请日:2022-08-02

    Applicant: SNOWFLAKE INC.

    Abstract: Joining data using a disjunctive operator is described. An example computer-implemented method can include generating a query plan for a query, wherein there is a join operator expression for each of a plurality of disjunctive predicates and each join operator expression includes at least a conjunctive predicate and a disjunctive operator. The method may also include generating a bloom filter for each of the plurality of disjunctive operators. The method may further include evaluating each of the plurality of join operator expressions using a corresponding one of the plurality of disjunctive operators and bloom filter for each of the plurality of disjunctive predicates to generate a result set.

    Resource provisioning in database systems

    公开(公告)号:US11514064B2

    公开(公告)日:2022-11-29

    申请号:US17663248

    申请日:2022-05-13

    Applicant: Snowflake Inc.

    Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.

    Materialized views over external tables in database systems

    公开(公告)号:US11507571B2

    公开(公告)日:2022-11-22

    申请号:US16385720

    申请日:2019-04-16

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for generating a materialized view over an external table. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating, by the database platform, a materialized view over the external table. The method includes receiving a notification that a modification has been made to the external table, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes, in response to the external table being modified, refreshing the materialized view such that the materialized view comprises an accurate representation of the external table.

    Data pruning based on metadata
    28.
    发明授权

    公开(公告)号:US11494337B2

    公开(公告)日:2022-11-08

    申请号:US17508705

    申请日:2021-10-22

    Applicant: SNOWFLAKE INC.

    Abstract: A system and method for pruning data based on metadata. The method may include receiving a query with a plurality of predicates and identifying one or more applicable files that includes database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.

    RESOURCE PROVISIONING IN DATABASE SYSTEMS

    公开(公告)号:US20220269676A1

    公开(公告)日:2022-08-25

    申请号:US17663248

    申请日:2022-05-13

    Applicant: Snowflake Inc.

    Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.

Patent Agency Ranking