Selecting between hydration-based scanning and stateless scale-out scanning to improve query performance

    公开(公告)号:US11593367B1

    公开(公告)日:2023-02-28

    申请号:US17489532

    申请日:2021-09-29

    Abstract: When a query is received by a stateful data processing service, the service determines, for each table scan (and associated operations) of a query, whether to select the table scan for execution by a stateless data processing service. The selected table scans are sent to the stateless data processing service for execution, and results are received by the stateful data processing service. The stateful data processing service may also execute other table scans of the query locally, against a local data cache. If the data is not present in the local data cache, then the stateful data processing service will copy the table data into the local data cache before executing the table scan. A query result based on the remote and/or local table scans may then be returned to the client.

    Selectively assigning join operations for remote data processing

    公开(公告)号:US11216457B1

    公开(公告)日:2022-01-04

    申请号:US15389318

    申请日:2016-12-22

    Abstract: Join operations may be selectively assigned from local data processing to remote data processing. Aggregation queries directed to distributed data sets may have query plans generated that include join operations. Some join operations may be executed locally while others may be assigned for remote execution at a remote data processing engine. Join operations may be identified and evaluated for remote execution according to a processing constraint for the remote data processing engine. For those join operations that satisfy the remote data processing engine, the query plan may be modified to assign the join operation for remote execution. A result for the aggregation query may be generated according to the modified query plan and the result sent in response to the aggregation query.

    Executing queries for structured data and not-structured data

    公开(公告)号:US10713247B2

    公开(公告)日:2020-07-14

    申请号:US15476273

    申请日:2017-03-31

    Abstract: Queries received at a query engine may be executed for structured data and not-structured data. A query execution plan may be generated for the query that includes stateless operations to apply the query to the not-structured data at remote query processing engines. The remote query processing engines may perform the stateless operations and return results to the query engine. The query engine may generate a result for the query based on the results received from the remote query engine as well as results determined as part of applying the query to structured data. The result to the query may be returned.

Patent Agency Ranking