-
公开(公告)号:US20190026042A1
公开(公告)日:2019-01-24
申请号:US15653249
申请日:2017-07-18
Applicant: VMware, Inc.
Inventor: Abhishek Gupta , Jorge Guerra Delgado , Sandeep Rangaswamy , Srinath Premachandran , Pascal Renauld
Abstract: Techniques for enabling deduplication-aware load balancing in a distributed storage system are provided. In one set of embodiments, a node of the distributed storage system can receive an I/O (Input/Output) request pertaining to a data block of a storage object stored on a local storage component of the node. The node can further determine whether the I/O request requires insertion of a new entry into a deduplication hash table associated with the local storage component or deletion of an existing entry from the deduplication hash table. If the I/O request requires insertion of a new hash table entry, the node can add an identifier of the data block into a probabilistic data structure associated with the local storage component, where the probabilistic data structure is configured to maintain information regarding distinct data blocks that are likely present in the local storage component. Alternatively, if the I/O request requires deletion of an existing hash table entry, the node can remove the identifier of the data block from the probabilistic data structure.
-
公开(公告)号:US10102057B2
公开(公告)日:2018-10-16
申请号:US14716756
申请日:2015-05-19
Applicant: VMware, Inc.
Inventor: Christos Karamanolis , Wenguang Wang , Kiran Joshi , Sandeep Rangaswamy
Abstract: Exemplary methods, apparatuses, and systems include a first layer of a virtual storage area network (VSAN) module receiving a write request from a data compute node. The write request includes data to be written and the VSAN module is distributed across a plurality of computers to provide an aggregate object store using storage attached to each of the plurality of computers. The first layer of the VSAN module calculates a checksum for the data to be written and passes the data to be written and the checksum to a second layer of the VSAN module. The second layer of the VSAN module calculates a first verification checksum for the data to be written. The data and the checksum are written to persistent storage in response to determining the first verification checksum matches the checksum passed by the first layer of the VSAN module.
-
公开(公告)号:US20170262465A1
公开(公告)日:2017-09-14
申请号:US15068988
申请日:2016-03-14
Applicant: VMware, Inc.
Inventor: Edward J. Goggin , Sandeep Rangaswamy , Kiran Joshi , Srinath Premachandran , Gopala Suryanarayana , Radu Berinde , Wenguang Wang , Jorge Guerra Delgado , Jin Zhang , Radhika Vullikanti
CPC classification number: G06F16/1748 , H04L67/1004 , H04L67/1097
Abstract: In accordance with the present disclosure, files may be deduplicated in a distributed storage system having a plurality of storage volumes. A uniqueness metric for each file may indicate a degree of deduplication of the respective data files in the given storage volume. The uniqueness metric may be used to identify files for rebalancing in the distributed storage system. The uniqueness metric may be efficiently calculated with enough accuracy using a sampling methodology.
-
公开(公告)号:US20220398025A1
公开(公告)日:2022-12-15
申请号:US17347491
申请日:2021-06-14
Applicant: VMware, Inc.
Inventor: Aditya Kotwal , Venkata Ramanan , Sandeep Rangaswamy , Brian Caulfield
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for executing concurrent writes to a data store. One of the systems includes a data store comprising a plurality of storage segments, wherein each storage segment comprises a plurality of blocks; and an allocator system comprising: a plurality of threads, and a plurality of bitmaps each corresponding to a respective storage segment of the data store, wherein the allocator system is configured to perform operations comprising: assigning a respective bitmap to each thread of the plurality of threads; and executing, by each thread of the plurality of threads, one or more write requests to one or more blocks of the storage segment corresponding to the thread using the bitmap assigned to the thread, wherein executing a write request by a thread includes updating the bitmap assigned to the thread.
-
公开(公告)号:US11507553B2
公开(公告)日:2022-11-22
申请号:US15927019
申请日:2018-03-20
Applicant: VMware, Inc.
Inventor: Abhishek Gupta , Rob T. Johnson , Srinath Premachandran , Richard P. Spillane , Sandeep Rangaswamy , Jorge Guerra Delgado , Kapil Chowksey , Wenguang Wang
IPC: G06F16/22 , G06F16/17 , G06F16/23 , G06F16/2453 , G06F16/2455
Abstract: Exemplary methods, apparatuses, and systems include a file system process inserting a first key/value pair and a second key/value pair into a first tree. The second key is a duplicate of the first key and the value of the second key/value pair is an operation changing the value. In response to a request for a range of key/value pairs, the process reads the second key/value pair and inserts it in a second tree. The process reads the first pair and determines, while inserting the first pair in the second tree, that the second key is a duplicate of the first key. The file system process determines an updated value of the first value by applying the operation in the second value to first value. The file system operation updates the second key/value pair in the second tree with the updated value and returns the requested range of key/value pairs.
-
公开(公告)号:US11487731B2
公开(公告)日:2022-11-01
申请号:US16931219
申请日:2020-07-16
Applicant: VMware, Inc.
Inventor: Abhishek Gupta , Richard P. Spillane , Robert T. Johnson , Srinath Premachandran , Jorge Guerra Delgado , Kapil Chowksey , Sandeep Rangaswamy
Abstract: Embodiments described herein are related to a method of scanning a B-tree. For example, a method comprises receiving a scan request to scan a B-tree having a plurality of levels, each level comprising one or more nodes, wherein for each of one or more levels of the plurality of levels, nodes are grouped into groups, where nodes of any given group are stored across sequential disk blocks. The method further comprises generating a queue for each level of the B-tree. For each queue, the method further comprises loading into memory a next group of nodes based upon determining a storage location of a node of the next group of nodes.
-
公开(公告)号:US11182344B2
公开(公告)日:2021-11-23
申请号:US15068988
申请日:2016-03-14
Applicant: VMware, Inc.
Inventor: Edward J Goggin , Sandeep Rangaswamy , Kiran Joshi , Srinath Premachandran , Gopala Suryanarayana , Radu Berinde , Wenguang Wang , Jorge Guerra Delgado , Jin Zhang , Radhika Vullikanti
IPC: G06F16/00 , G06F16/174 , H04L29/08
Abstract: In accordance with the present disclosure, files may be deduplicated in a distributed storage system having a plurality of storage volumes. A uniqueness metric for each file may indicate a degree of deduplication of the respective data files in the given storage volume. The uniqueness metric may be used to identify files for rebalancing in the distributed storage system. The uniqueness metric may be efficiently calculated with enough accuracy using a sampling methodology.
-
公开(公告)号:US10983909B2
公开(公告)日:2021-04-20
申请号:US16252488
申请日:2019-01-18
Applicant: VMware, Inc.
Inventor: Abhishek Gupta , Robert T. Johnson , Richard P. Spillane , Sandeep Rangaswamy , Jorge Guerra Delgado , Kapil Chowksey , Srinath Premachandran
IPC: G06F12/0804 , G06F16/22 , G06F7/16 , G06F16/2455
Abstract: Certain aspects provide systems and methods for performing an operation on a Bε-tree. A method comprises writing a message associated with the operation to a first slot in a first buffer of a first non-leaf node of the Bε-tree in an append-only manner, wherein a first filter associated with the first slot is used for query operations associated with the first slot. The method further comprises determining that the first buffer is full and, upon determining to flush the message to a non-leaf child node, flushing the message in an append-only manner to a second slot in a second buffer of the non-leaf child node, wherein a second filter associated with the second slot is used for query operations associated with the second slot. The method further comprises, upon determining to flush the message to a leaf node, flushing the message to the leaf node in a sorted manner.
-
公开(公告)号:US10824610B2
公开(公告)日:2020-11-03
申请号:US16134564
申请日:2018-09-18
Applicant: VMware, Inc.
Inventor: Robert T Johnson , Ittai Abraham , Abhishek Gupta , Richard P Spillane , Srinath Premachandran , Jorge Guerra Delgado , Sandeep Rangaswamy , Kapil Chowksey
Abstract: A buffer tree structure includes, at each internal node, a buffer having a compacted portion and an uncompacted portion. Insertion of data elements to the buffer tree can occur units called packets. A packet is initially stored in the uncompacted portion of a receiving node's buffer. When a compaction trigger condition exists, packet compaction is performed including a data element compaction operation. A buffer-emptying (flush) operation pushes the compacted packets to children nodes.
-
公开(公告)号:US20200089788A1
公开(公告)日:2020-03-19
申请号:US16134564
申请日:2018-09-18
Applicant: VMware, Inc.
Inventor: Robert T. Johnson , Ittai Abraham , Abhishek Gupta , Richard P. Spillane , Srinath Premachandran , Jorge Guerra Delgado , Sandeep Rangaswamy , Kapil Chowksey
IPC: G06F17/30
Abstract: A buffer tree structure includes, at each internal node, a buffer having a compacted portion and an uncompacted portion. Insertion of data elements to the buffer tree can occur units called packets. A packet is initially stored in the uncompacted portion of a receiving node's buffer. When a compaction trigger condition exists, packet compaction is performed including a data element compaction operation. A buffer-emptying (flush) operation pushes the compacted packets to children nodes.
-
-
-
-
-
-
-
-
-