System and method for executing data processing tasks using resilient distributed datasets (RDDs) in a storage device

Invention Grant

US10176092B2 System and method for executing data processing tasks using resilient distributed datasets (RDDs) in a storage device 有权

Please log in to see more content

Patent Title: System and method for executing data processing tasks using resilient distributed datasets (RDDs) in a storage device
Application No.: US15710722

Application Date: 2017-09-20
Publication No.: US10176092B2

Publication Date: 2019-01-08
Inventor: Joao Alcantara , Vladimir Alves , Ricardo Cassia , Vincent Lazo
Applicant: NGD Systems, Inc.
Applicant Address: US CA Irvine
Assignee: NGD Systems, Inc.
Current Assignee: NGD Systems, Inc.
Current Assignee Address: US CA Irvine
Agency: Lewis Roca Rothgerber Christie LLP
Main IPC: G06F12/02
IPC: G06F12/02 ; G06F17/30 ; G06F9/50

System and method for executing data processing tasks using resilient distributed datasets (RDDs) in a storage device

Abstract:

A system and method of providing enhanced data processing and analysis in an infrastructure for distributed computing and large-scale data processing. This infrastructure uses the Apache Spark framework to divide an application into a large number of small fragments of work, each of which may be performed on one of a large number of compute nodes. The work may involve Spark transformations, operations, and actions, which may be used to categorize and analyze large amounts of data in distributed systems. This infrastructure includes a cluster with a driver node and a plurality of worker nodes. The worker nodes may be, or may include, intelligent solid state drives capable of executing data processing functions under the Apache Spark framework. The use of intelligent solid state drives reduces the need to exchange data with a central processing unit (CPU) in a server.

Public/Granted literature

US20180081798A1 SYSTEM AND METHOD FOR EXECUTING DATA PROCESSING TASKS USING RESILIENT DISTRIBUTED DATASETS (RDDS) IN A STORAGE DEVICE Public/Granted day:2018-03-22

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F12/00	安装在筛选装置之上的在存储器系统或体系结构内的存取、寻址或分配（来自记录载体的数字输入，或者到记录载体上去的数字输出，例如到磁盘存储单元，G06F3/06）
G06F12/02	.寻址或地址分配；地址的重新分配（程序地址排序入G06F9/00；在数字存储器中选择地址的设计入G11C8/00）