-
公开(公告)号:US11307894B1
公开(公告)日:2022-04-19
申请号:US16659798
申请日:2019-10-22
Applicant: Pure Storage, Inc.
Inventor: Ivan Jibaja , Stefan Dorsett , Prashant Jaikumar , Roy Kim , Curtis Pullen
Abstract: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
公开(公告)号:US11263095B1
公开(公告)日:2022-03-01
申请号:US17010565
申请日:2020-09-02
Applicant: PURE STORAGE, INC.
Inventor: Ivan Jibaja , Curtis Pullen , Prashant Jaikumar , Stefan Dorsett , Gaurav Jain , Neil Vachharajani , Srinivas Chellappa
Abstract: Providing for high availability in a data analytics pipeline without replicas, including: creating a data analytics pipeline, wherein each component of the data analytics pipeline is deployed within a container; creating a failover container; detecting that a component within the data analytics pipeline has failed; and responsive to detecting that the component within the data analytics pipeline has failed, deploying the component within the data analytics pipeline that has failed in the failover container.
-
公开(公告)号:US12008404B2
公开(公告)日:2024-06-11
申请号:US17721175
申请日:2022-04-14
Applicant: PURE STORAGE, INC.
Inventor: Ivan Jibaja , Prashant Jaikumar , Stefan Dorsett , Curtis Pullen , Roy Kim
CPC classification number: G06F9/5011 , G06F9/4856 , G06F9/505 , G06F16/2272 , G06F16/258
Abstract: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
公开(公告)号:US11860820B1
公开(公告)日:2024-01-02
申请号:US16374175
申请日:2019-04-03
Applicant: PURE STORAGE, INC.
Inventor: Ivan Jibaja , Curtis Pullen , Stefan Dorsett , Srinivas Chellappa , Prashant Jaikumar
CPC classification number: G06F16/156 , G06F9/45558 , G06F16/1734 , G06F16/1858 , G06F2009/45595
Abstract: Processing data through a storage system in a data pipeline including receiving, by the storage system, a dataset from a collector on a data producer, wherein the dataset is disaggregated from metadata for the dataset by the collector; storing the dataset on the storage system; receiving, by the storage system from a data indexer, a request for data from the dataset, wherein the request for the data comprises the metadata gathered by the collector on the data producer; servicing, by the storage system, the request for the data by locating the data using the metadata gathered by the collector on the data producer and received in the request for the data; and receiving, from the data indexer, indexed data indexed using the metadata gathered by the collector on the data producer.
-
公开(公告)号:US11714728B2
公开(公告)日:2023-08-01
申请号:US17669076
申请日:2022-02-10
Applicant: PURE STORAGE, INC.
Inventor: Ivan Jibaja , Curtis Pullen , Prashant Jaikumar , Stefan Dorsett , Gaurav Jain , Neil Vachharajani , Srinivas Chellappa
CPC classification number: G06F11/2023 , G06F11/2094 , G06F2201/85
Abstract: Providing for high availability in a data analytics pipeline without replicas, including: creating a data analytics pipeline, wherein each component of the data analytics pipeline is deployed within a container; creating a failover container; detecting that a component within the data analytics pipeline has failed; and responsive to detecting that the component within the data analytics pipeline has failed, deploying the component within the data analytics pipeline that has failed in the failover container.
-
公开(公告)号:US10452444B1
公开(公告)日:2019-10-22
申请号:US15883333
申请日:2018-01-30
Applicant: Pure Storage, Inc.
Inventor: Ivan Jibaja , Stefan Dorsett , Prashant Jaikumar , Roy Kim , Curtis Pullen
Abstract: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
-
-
-
-