-
公开(公告)号:US12026532B2
公开(公告)日:2024-07-02
申请号:US18229450
申请日:2023-08-02
Applicant: Google LLC
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
IPC: G06F9/46 , G06F8/30 , G06F8/34 , G06F8/41 , G06F9/38 , G06F9/44 , G06F9/445 , G06F9/455 , G06F9/48 , G06F16/25 , G06F21/57 , G06F21/62 , G06F9/30 , G06F9/448 , G06F16/2453
CPC classification number: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/45533 , G06F9/4843 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F9/30 , G06F9/4494 , G06F16/24532 , G06F16/24547 , G06F2221/034
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
-
公开(公告)号:US20230376332A1
公开(公告)日:2023-11-23
申请号:US18229450
申请日:2023-08-02
Applicant: Google LLC
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
IPC: G06F9/455 , G06F21/62 , G06F9/38 , G06F8/41 , G06F9/48 , G06F8/30 , G06F8/34 , G06F9/44 , G06F9/445 , G06F21/57
CPC classification number: G06F9/45504 , G06F21/62 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F8/433 , G06F9/4843 , G06F8/314 , G06F8/34 , G06F9/44 , G06F9/45533 , G06F9/445 , G06F21/577 , G06F21/6218 , G06F16/24532
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
-
公开(公告)号:US10025620B2
公开(公告)日:2018-07-17
申请号:US14989867
申请日:2016-01-07
Applicant: Google LLC
Inventor: Robert Bradshaw , Craig D. Chambers , Ezra Cooper , Ashish Raniwala , Frances J. Perry
Abstract: One example method includes identifying synchronous code including instructions specifying a computing operation to be performed on a set of data; transforming the synchronous code into a pipeline application including one or more pipeline objects; identifying a first input data set on which to execute the pipeline application; executing the pipeline application on a first input data set to produce a first output data set; after executing the pipeline application on the first input data set, identifying a second input data set on which to execute the pipeline application; determining a set of differences between the first input data set and second input data set; and executing the pipeline application on the set of differences to produce a second output data set.
-
公开(公告)号:US20180314545A1
公开(公告)日:2018-11-01
申请号:US16030183
申请日:2018-07-09
Applicant: Google LLC
Inventor: Robert Bradshaw , Craig D. Chambers , Ezra Cooper , Ashish Raniwala , Frances J. Perry
CPC classification number: G06F9/466 , G06F8/453 , G06F9/3867 , G06F9/3885 , G06F9/455 , G06F9/5066 , G06F11/3006 , G06F17/30463 , G06F17/30569
Abstract: One example method includes identifying synchronous code including instructions specifying a computing operation to be performed on a set of data; transforming the synchronous code into a pipeline application including one or more pipeline objects; identifying a first input data set on which to execute the pipeline application; executing the pipeline application on a first input data set to produce a first output data set; after executing the pipeline application on the first input data set, identifying a second input data set on which to execute the pipeline application; determining a set of differences between the first input data set and second input data set; and executing the pipeline application on the set of differences to produce a second output data set.
-
公开(公告)号:US20240338235A1
公开(公告)日:2024-10-10
申请号:US18748889
申请日:2024-06-20
Applicant: Google LLC
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
IPC: G06F9/455 , G06F8/30 , G06F8/34 , G06F8/41 , G06F9/30 , G06F9/38 , G06F9/44 , G06F9/445 , G06F9/448 , G06F9/48 , G06F16/2453 , G06F21/57 , G06F21/62
CPC classification number: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/45533 , G06F9/4843 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F9/30 , G06F9/4494 , G06F16/24532 , G06F16/24547 , G06F2221/034
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
-
公开(公告)号:US10558501B2
公开(公告)日:2020-02-11
申请号:US15395225
申请日:2016-12-30
Applicant: Google LLC
Inventor: Robert Bradshaw , Rafael de Jesús Fernández Moctezuma , Daniel Mills , Samuel Green McVeety , Samuel Carl Whittle , Andrei Maksimenka , Cosmin Ionel Arad , Mark Brian Shields , Harris Samuel Nover , Manuel Alfred Fahndrich , Jeffrey Paul Gardner , Mikhail Shmulyan , Reuven Lax , Ahmet Altay , Craig D. Chambers
Abstract: A job that receives as input a data stream is executed. For the job, it is iteratively determined a backlog growth over a first period of time; a backlog volume; and whether to adjust a quantity of processing resources. For each iteration for which the quantity of processing resources allocated to the job are determined to be adjusted, adjusting the quantity of processing resources allocated to the job. For each iteration for which the quantity of processing resources allocated to the job are determined not to be adjusted, maintaining the quantity of processing resources allocated to the job.
-
公开(公告)号:US10133592B2
公开(公告)日:2018-11-20
申请号:US15483044
申请日:2017-04-10
Applicant: Google LLC
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
IPC: G06F9/44 , G06F9/45 , G06F17/30 , G06F9/455 , G06F21/62 , G06F9/38 , G06F8/41 , G06F9/48 , G06F8/30 , G06F8/34 , G06F9/445 , G06F21/57 , G06F9/30 , G06F9/448
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
-
公开(公告)号:US11755351B2
公开(公告)日:2023-09-12
申请号:US17834256
申请日:2022-06-07
Applicant: Google LLC
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
IPC: G06F9/38 , G06F16/2453 , G06F21/57 , G06F9/445 , G06F9/455 , G06F21/62 , G06F8/41 , G06F9/48 , G06F8/30 , G06F8/34 , G06F9/44 , G06F9/448 , G06F9/30
CPC classification number: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/45533 , G06F9/4843 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F9/30 , G06F9/4494 , G06F16/24532 , G06F16/24547 , G06F2221/034
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
-
公开(公告)号:US11726893B2
公开(公告)日:2023-08-15
申请号:US17465397
申请日:2021-09-02
Applicant: Google LLC
Inventor: Mariana Sharp , Craig D. Chambers , Jelena Pjesivac-Grbovic , Kym Hines , Sunghwan Ihm , Danijela Mijailovic , Ben Walters
CPC classification number: G06F11/3409 , G06F9/46 , G06F11/3006 , G06F9/44505
Abstract: Aspects of the disclosure relate to evaluating the rollout of a change to a data processing framework (DPF). A first set of jobs is selected out of a plurality of jobs. The first set of jobs is processed by the DPF with the proposed change enabled and the other jobs of the plurality of jobs by the DPF without the change enabled. A performance cost value is determined for each of the jobs in the first set of jobs processed by the DPF with the proposed change enabled and for each of the other jobs of the plurality of jobs processed by the DPF with the proposed change disabled. Based on the performance cost values of the jobs in the first set of jobs and the performance cost values of the other jobs of the plurality of jobs, a determination is made whether the proposed change is beneficial to the DPF.
-
公开(公告)号:US20200225991A1
公开(公告)日:2020-07-16
申请号:US16781467
申请日:2020-02-04
Applicant: Google LLC
Inventor: Robert Bradshaw , Rafael de Jesús Fernández Moctezuma , Daniel Mills , Samuel Green McVeety , Samuel Carl Whittle , Andrei Maksimenka , Cosmin Ionel Arad , Mark Brian Shields , Harris Samuel Nover , Manuel Alfred Fahndrich , Jeffrey Paul Gardner , Mikhail Shmulyan , Reuven Lax , Ahmet Altay , Craig D. Chambers
IPC: G06F9/50
Abstract: A job that receives as input a data stream is executed. For the job, it is iteratively determined a backlog growth over a first period of time; a backlog volume; and whether to adjust a quantity of processing resources. For each iteration for which the quantity of processing resources allocated to the job are determined to be adjusted, adjusting the quantity of processing resources allocated to the job. For each iteration for which the quantity of processing resources allocated to the job are determined not to be adjusted, maintaining the quantity of processing resources allocated to the job.
-
-
-
-
-
-
-
-
-