DISTRIBUTED DATA SET INDEXING
    1.
    发明申请

    公开(公告)号:WO2018148059A1

    公开(公告)日:2018-08-16

    申请号:PCT/US2018/015919

    申请日:2018-01-30

    Abstract: An apparatus including a processor to receive search criteria including a data value for a search within a data field; in response to the receipt of the query instructions, and for each data cell within a super cell, perform the specified search by comparing the data value to ranges of values indicated in a corresponding cell index to determine whether the data cell includes a data record meeting the search criteria, and in response to a determination that the data cell includes such a data record, use a unique values index in the cell index to search the data records of the data cell to identify one or more data records meeting the search criteria; and in response to identifying at least one data record meeting the search criteria, provide an indication that at least the data cell includes at least one data record meeting the search criteria.

    DISTRIBUTED DATA SET STORAGE AND RETRIEVAL
    2.
    发明申请
    DISTRIBUTED DATA SET STORAGE AND RETRIEVAL 审中-公开
    分布式数据存储和检索

    公开(公告)号:WO2017019794A1

    公开(公告)日:2017-02-02

    申请号:PCT/US2016/044309

    申请日:2016-07-27

    Abstract: An apparatus includes a processor component caused to: retrieve metadata of organization of data within a data set, and map data of organization of data blocks within a data file; receive indications of which node devices are available to perform a processing task with a data set portion; and in response to the data set including partitioned data, compare the quantities of available node devices and of the node devices last involved in storing the data set. In response to a match, for cacti map data map entry: retrieve a hashed identifier for a data sub-block, and a size for each of the data sub-blocks within the corresponding data block; divide the hashed identifier by the quantity of available node devices; compare the modulo value to a designation assigned to each of the available node devices; and provide a pointer to the available node device assigned the matching designation.

    Abstract translation: 一种装置包括:处理器组件,用于:检索数据集内的数据组织的元数据,以及映射数据文件内的数据块的组织数据; 接收哪些节点设备可用于使用数据组部分执行处理任务的指示; 并且响应于包括分割数据的数据集,比较存储数据集的最后涉及的可用节点设备和节点设备的数量。 响应于匹配,对于仙人掌映射数据映射条目:检索用于数据子块的散列标识符以及相应数据块内的每个数据子块的大小; 将哈希标识符除以可用节点设备的数量; 将模值与分配给每个可用节点设备的指定进行比较; 并提供指向分配了匹配名称的可用节点设备的指针。

    DISTRIBUTED COLUMNAR DATA SET STORAGE AND RETRIEVAL

    公开(公告)号:WO2021101798A

    公开(公告)日:2021-05-27

    申请号:PCT/US2020/060379

    申请日:2020-11-13

    Abstract: An apparatus includes a processor to: instantiate collection threads, data buffers of a queue, and aggregation threads: within each collection thread, assemble a row group from a subset of the multiple rows, reorganize the data values row-wise to columnar organization, and store the row group within a data buffer of the queue; operate the buffer queue as a FIFO buffer; within each aggregation thread, retrieve multiple row groups from multiple data buffers of the queue, assemble a data set part from the multiple row groups, transmit, to storage device(s) via a network, the data set part; and in response to each instance of retrieval of a row group from a data buffer of the buffer queue for use within an aggregation thread, analyze a level of availability of at least storage space within the node device to determine whether to dynamically adjust the quantity of data buffers of the buffer queue.

    DISTRIBUTED DATA SET ENCRYPTION AND DECRYPTION

    公开(公告)号:WO2018231266A1

    公开(公告)日:2018-12-20

    申请号:PCT/US2017/052486

    申请日:2017-09-20

    Abstract: An apparatus includes a processor component of a first node device caused to receive data block encryption data and an indication of size of an encrypted data block distributed to the first node device for decryption, and in response to the data set being of encryptd data: receive an indication of the quantity of sub-blocks within the encrypted data block, and a hashed identifier for each data sub-block; use the data block encryption data to decrypt the encrypted data block to regenerate data set portions from the data sub-blocks; analyze the hashed identifier of each data sub-block to determine whether all data set portions are distributed to the first node device for processing; and in response to a determination that at least one data set portion is to be distributed to a second node device for processing, transmit the at least one data set portion to the second node device.

    DISTRIBUTED DATA SET INDEXING
    5.
    发明公开

    公开(公告)号:EP3828693A1

    公开(公告)日:2021-06-02

    申请号:EP20214943.1

    申请日:2018-01-30

    Abstract: An apparatus including a processor to receive search criteria including a data value for a search within a data field; in response to the receipt of the query instructions, and for each data cell within a super cell, perform the specified search by comparing the data value to ranges of values indicated in a corresponding cell index to determine whether the data cell includes a data record meeting the search criteria, and in response to a determination that the data cell includes such a data record, use a unique values index in the cell index to search the data records of the data cell to identify one or more data records meeting the search criteria; and in response to identifying at least one data record meeting the search criteria, provide an indication that at least the data cell includes at least one data record meeting the search criteria.

Patent Agency Ranking