-
公开(公告)号:GB2523238A
公开(公告)日:2015-08-19
申请号:GB201422580
申请日:2014-12-18
Applicant: IBM
Inventor: TAN WEI , MENG XIAOQIAO , ZHANG ZHE , WANG GUOHUI
IPC: G06F17/30
Abstract: A method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a strict subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.