OPTIMIZING DISTRIBUTED DATA ANALYTICS FOR SHARED STORAGE
    2.
    发明申请
    OPTIMIZING DISTRIBUTED DATA ANALYTICS FOR SHARED STORAGE 有权
    优化用于共享存储的分布式数据分析

    公开(公告)号:US20150334203A1

    公开(公告)日:2015-11-19

    申请号:US14814445

    申请日:2015-07-30

    Applicant: NetApp, Inc.

    Abstract: Methods, systems, and computer executable instructions for performing distributed data analytics are provided. In one exemplary embodiment, a method of performing a distributed data analytics job includes collecting application-specific information in a processing node assigned to perform a task to identify data necessary to perform the task. The method also includes requesting a chunk of the necessary data from a storage server based on location information indicating one or more locations of the data chunk and prioritizing the request relative to other data requests associated with the job. The method also includes receiving the data chunk from the storage server in response to the request and storing the data chunk in a memory cache of the processing node which uses a same file system as the storage server.

    Abstract translation: 提供了用于执行分布式数据分析的方法,系统和计算机可执行指令。 在一个示例性实施例中,执行分布式数据分析作业的方法包括在分配用于执行任务以识别执行任务所需的数据的处理节点中收集特定于应用的信息。 该方法还包括基于指示数据块的一个或多个位置的位置信息和相对于与该作业相关联的其它数据请求对该请求进行优先级排序从存储服务器请求一组必要数据。 该方法还包括响应于请求从存储服务器接收数据块,并将数据块存储在使用与存储服务器相同的文件系统的处理节点的存储器高速缓存中。

Patent Agency Ranking