-
公开(公告)号:US11210140B1
公开(公告)日:2021-12-28
申请号:US16888135
申请日:2020-05-29
Applicant: PURE STORAGE, INC.
Inventor: Brian Gold , Emily Potyraj , Ivan Jibaja , Igor Ostrovsky , Roy Kim
IPC: G06N99/00 , G06F9/50 , G06F3/06 , G06N20/00 , G06F16/245 , G06F9/48 , G06N3/063 , G06N3/08 , G06T1/20 , G06T1/60 , G06F16/958 , G06F16/248
Abstract: Data transformation offloading in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within the storage system, a dataset; identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to the dataset; and generating, by the storage system in dependence upon the one or more transformations, a transformed dataset.
-
公开(公告)号:US10275285B1
公开(公告)日:2019-04-30
申请号:US16046337
申请日:2018-07-26
Applicant: PURE STORAGE, INC.
Inventor: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
Abstract: Data transformation caching in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to a dataset; generating, in dependence upon the one or more transformations, a transformed dataset; storing, within one or more of the storage systems, the transformed dataset; receiving a plurality of requests to transmit the transformed dataset to one or more of the GPU servers; and responsive to each request, transmitting, from the one or more storage systems to the one or more GPU servers without re-performing the one or more transformations on the dataset, the transformed dataset.
-
公开(公告)号:US11803338B2
公开(公告)日:2023-10-31
申请号:US17538262
申请日:2021-11-30
Applicant: PURE STORAGE, INC.
Inventor: Brian Gold , Emily Potyraj , Ivan Jibaja , Igor Ostrovsky , Roy Kim
IPC: G06F3/06 , G06N20/00 , G06F16/245 , G06F16/178 , G06Q30/0242 , G06F9/48 , G06F9/50 , G06N3/063 , G06N3/08 , G06T1/20 , G06T1/60 , G06F16/958 , G06F16/248
CPC classification number: G06F3/0679 , G06F3/0604 , G06F3/067 , G06F3/0608 , G06F3/0646 , G06F3/0649 , G06F9/4881 , G06F9/5027 , G06F16/1794 , G06F16/245 , G06N3/063 , G06N3/08 , G06N20/00 , G06Q30/0243 , G06T1/20 , G06T1/60 , G06F16/248 , G06F16/972 , G06T2200/28
Abstract: Executing a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: receiving, by a graphical processing unit (‘GPU’) server, a dataset transformed by a storage system that is external to the GPU server; and executing, by the GPU server, one or more machine learning algorithms using the transformed dataset as input.
-
公开(公告)号:US10452444B1
公开(公告)日:2019-10-22
申请号:US15883333
申请日:2018-01-30
Applicant: Pure Storage, Inc.
Inventor: Ivan Jibaja , Stefan Dorsett , Prashant Jaikumar , Roy Kim , Curtis Pullen
Abstract: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
公开(公告)号:US10360214B2
公开(公告)日:2019-07-23
申请号:US16045814
申请日:2018-07-26
Applicant: PURE STORAGE, INC.
Inventor: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
Abstract: Ensuring reproducibility in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, by a unified management plane, one or more transformations applied to a dataset by the artificial intelligence infrastructure, wherein applying the one or more transformations to the dataset causes the artificial intelligence infrastructure to generate a transformed dataset; storing, within the one or more storage systems, information describing the dataset, the one or more transformations applied to the dataset, and the transformed dataset; identifying, by the unified management plane, one or more machine learning models executed by the artificial intelligence infrastructure using the transformed dataset as input; and storing, within the one or more storage systems, information describing one or more machine learning models executed using the transformed dataset as input.
-
公开(公告)号:US11307894B1
公开(公告)日:2022-04-19
申请号:US16659798
申请日:2019-10-22
Applicant: Pure Storage, Inc.
Inventor: Ivan Jibaja , Stefan Dorsett , Prashant Jaikumar , Roy Kim , Curtis Pullen
Abstract: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
公开(公告)号:US10671434B1
公开(公告)日:2020-06-02
申请号:US16040846
申请日:2018-07-20
Applicant: PURE STORAGE, INC.
Inventor: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
Abstract: Data transformation offloading in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within the storage system, a dataset; identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to the dataset; and generating, by the storage system in dependence upon the one or more transformations, a transformed dataset.
-
公开(公告)号:US11768636B2
公开(公告)日:2023-09-26
申请号:US18146807
申请日:2022-12-27
Applicant: PURE STORAGE, INC.
Inventor: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
IPC: H04L67/12 , G06F3/06 , G06N20/00 , G06F16/245 , G06F16/178 , G06Q30/0242 , G06F9/48 , G06F9/50 , G06N3/063 , G06N3/08 , G06T1/20 , G06T1/60 , G06F16/958 , G06F16/248
CPC classification number: G06F3/0679 , G06F3/0604 , G06F3/067 , G06F3/0608 , G06F3/0646 , G06F3/0649 , G06F9/4881 , G06F9/5027 , G06F16/1794 , G06F16/245 , G06N3/063 , G06N3/08 , G06N20/00 , G06Q30/0243 , G06T1/20 , G06T1/60 , G06F16/248 , G06F16/972 , G06T2200/28
Abstract: Generating a transformed dataset for use by a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within one or more storage systems, a transformed dataset generated by applying one or more transformations to a dataset that are identified based on one or more expected input formats of data received as input data by one or more machine learning models to be executed on one or more servers; and transmitting, from the one or more storage systems to the one or more servers without reapplying the one or more transformations on the dataset, the transformed dataset including data in the one or more expected formats of data to be received as input data by the one or more machine learning models.
-
公开(公告)号:US11556280B2
公开(公告)日:2023-01-17
申请号:US16888402
申请日:2020-05-29
Applicant: PURE STORAGE, INC.
Inventor: Brian Gold , Emily Watkins , Ivan Jibaja , Igor Ostrovsky , Roy Kim
IPC: G06F3/06 , G06N20/00 , G06F16/245 , G06F16/178 , G06Q30/02 , G06F9/48 , G06F9/50 , G06N3/063 , G06N3/08 , G06T1/20 , G06T1/60 , G06F16/958 , G06F16/248
Abstract: Data transformation caching in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to a dataset; generating, in dependence upon the one or more transformations, a transformed dataset; storing, within one or more of the storage systems, the transformed dataset; receiving a plurality of requests to transmit the transformed dataset to one or more of the GPU servers; and responsive to each request, transmitting, from the one or more storage systems to the one or more GPU servers without re-performing the one or more transformations on the dataset, the transformed dataset.
-
公开(公告)号:US12008404B2
公开(公告)日:2024-06-11
申请号:US17721175
申请日:2022-04-14
Applicant: PURE STORAGE, INC.
Inventor: Ivan Jibaja , Prashant Jaikumar , Stefan Dorsett , Curtis Pullen , Roy Kim
CPC classification number: G06F9/5011 , G06F9/4856 , G06F9/505 , G06F16/2272 , G06F16/258
Abstract: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
-
-
-
-
-
-
-
-
-