-
公开(公告)号:US20240265010A1
公开(公告)日:2024-08-08
申请号:US18221735
申请日:2023-07-13
Applicant: Databricks, Inc.
Inventor: Saksham Garg , Bogdan Ionut Ghit , Christopher Stevens , Christian Stuart
IPC: G06F16/2453 , G06F16/25 , G06F16/28
CPC classification number: G06F16/24539 , G06F16/24542 , G06F16/256 , G06F16/285
Abstract: A multi-cluster computing system which includes a query result caching system is presented. The multi-cluster computing system may include a data processing service and client devices communicatively coupled over a network. The data processing service may include a control layer and a data layer. The control layer may be configured to receive and process requests from the client devices and manage resources in the data layer. The data layer may be configured to include instances of clusters of computing resources for executing jobs. The data layer may include a data storage system, which further includes a remote query result cache Store. The query result cache store may include a cloud storage query result cache which stores data associated with results of previously executed requests. As such, when a cluster encounters a previously executed request, the cluster may efficiently retrieve the cached result of the request from the in-memory query result cache or the cloud storage query result cache.
-
公开(公告)号:US20240394271A1
公开(公告)日:2024-11-28
申请号:US18614380
申请日:2024-03-22
Applicant: Databricks, Inc.
Inventor: Bogdan Ionut Ghit , Juliusz Sompolski , Shi Xin , Bart Samwel
IPC: G06F16/2458 , G06F11/34 , G06F16/242 , G06F16/25
Abstract: The system is configured to: 1) receive a client request; 2) determine executor(s) to generate a response to the user request; 3) provide each of the executor(s) with an indication; 4) receive for each indication a response including an output of either a cloud output or an in-line output to generate a group of in-line outputs and a group of cloud outputs; 5) determine whether the group of in-line outputs comprises all outputs; and 6) in response to the group of in-line outputs not comprising all the outputs for the client request: a) convert the group of in-line outputs to a converted group of cloud outputs; b) generate metadata for the converted group of cloud outputs and the group of cloud outputs; and c) provide response to the client request including the metadata for the converted group of cloud outputs and the group of cloud outputs.
-
公开(公告)号:US11960494B1
公开(公告)日:2024-04-16
申请号:US17841946
申请日:2022-06-16
Applicant: Databricks, Inc.
Inventor: Bogdan Ionut Ghit , Juliusz Sompolski , Shi Xin , Bart Samwel
IPC: G06F16/2458 , G06F11/34 , G06F16/242 , G06F16/25
CPC classification number: G06F16/2471 , G06F11/3419 , G06F16/244 , G06F16/256
Abstract: The system is configured to: 1) receive a client request; 2) determine executor(s) to generate a response to the user request; 3) provide each of the executor(s) with an indication; 4) receive for each indication a response including an output of either a cloud output or an in-line output to generate a group of in-line outputs and a group of cloud outputs; 5) determine whether the group of in-line outputs comprises all outputs; and 6) in response to the group of in-line outputs not comprising all the outputs for the client request: a) convert the group of in-line outputs to a converted group of cloud outputs; b) generate metadata for the converted group of cloud outputs and the group of cloud outputs; and c) provide response to the client request including the metadata for the converted group of cloud outputs and the group of cloud outputs.
-
公开(公告)号:US12189625B2
公开(公告)日:2025-01-07
申请号:US18222343
申请日:2023-07-14
Applicant: Databricks, Inc.
Inventor: Bogdan Ionut Ghit , Saksham Garg , Christian Stuart , Christopher Stevens
IPC: G06F16/24 , G06F16/2453 , G06F16/25 , G06F16/28
Abstract: A multi-cluster computing system which includes a query result caching system is presented. The multi-cluster computing system may include a data processing service and client devices communicatively coupled over a network. The data processing service may include a control layer and a data layer. The control layer may be configured to receive and process requests from the client devices and manage resources in the data layer. The data layer may be configured to include instances of clusters of computing resources for executing jobs. The data layer may include a data storage system, which further includes a remote query result cache Store. The query result cache store may include a cloud storage query result cache which stores data associated with results of previously executed requests. As such, when a cluster encounters a previously executed request, the cluster may efficiently retrieve the cached result of the request from the in-memory query result cache or the cloud storage query result cache.
-
公开(公告)号:US20240265011A1
公开(公告)日:2024-08-08
申请号:US18222343
申请日:2023-07-14
Applicant: Databricks, Inc.
Inventor: Saksham Garg , Bogdan Ionut Ghit , Christopher Stevens , Christian Stuart
IPC: G06F16/2453
CPC classification number: G06F16/24539
Abstract: A multi-cluster computing system which includes a query result caching system is presented. The multi-cluster computing system may include a data processing service and client devices communicatively coupled over a network. The data processing service may include a control layer and a data layer. The control layer may be configured to receive and process requests from the client devices and manage resources in the data layer. The data layer may be configured to include instances of clusters of computing resources for executing jobs. The data layer may include a data storage system, which further includes a remote query result cache Store. The query result cache store may include a cloud storage query result cache which stores data associated with results of previously executed requests. As such, when a cluster encounters a previously executed request, the cluster may efficiently retrieve the cached result of the request from the in-memory query result cache or the cloud storage query result cache.
-
-
-
-