-
公开(公告)号:US20230385157A1
公开(公告)日:2023-11-30
申请号:US18325853
申请日:2023-05-30
Applicant: Cloudera, Inc.
Inventor: Prashant Pogde , Siddharth Wagle , Siyao Meng , Nandakumar Vadivelu , Sadanand Shenoy
IPC: G06F11/14 , G06F16/11 , G06F16/13 , G06F16/182 , G06F16/178
CPC classification number: G06F11/1458 , G06F16/122 , G06F16/134 , G06F16/1844 , G06F16/178 , G06F2201/84
Abstract: Snapshot or point-in-time image functionality improves the use of object-based datastores. An example system includes an object-based datastore and a metadata datastore associated with the object-based datastore. Instances of the metadata datastore are created as snapshot images of the object-based datastore. Comparison of snapshot images is important for database analytics, disaster recovery, data protection, and more. Example techniques provide comparison of snapshot images (as metadata datastore instances) and remain robust and accurate in view of compactions performed by the metadata datastore. An example technique includes generating and updating a graph-based data structure that captures relationships between metadata files in the metadata datastore, particularly between pre-compaction files and post-compaction files. The example technique further includes referencing the graph-based data structure to accelerate snapshot image comparison based on determining whether files of a source snapshot image were compacted into files of a destination snapshot image, and/or vice versa.
-
公开(公告)号:US20230385304A1
公开(公告)日:2023-11-30
申请号:US18325825
申请日:2023-05-30
Applicant: Cloudera, Inc.
Inventor: Prashant Pogde , Siddharth Wagle , Nandakumar Vadivelu , Mukul Kumar Singh
CPC classification number: G06F16/27 , G06F11/1458 , G06F2201/84
Abstract: Snapshot or point-in-time image functionality improves the use of object-based datastores. An example system includes an object-based datastore and a metadata datastore associated with the object-based datastore. Instances of the metadata datastore each identify the distributed data blocks that respectively store the objects included in the object-based datastore. An instance of the metadata datastore is used as a snapshot image of a set of objects of the object-based datastore based on identifying the distributed data blocks that were mapped to the set of objects during a point in time. A different instance of the metadata datastore is dynamically updated to reflect present mappings of distributed data blocks to objects in the object-based datastore. Different instances of the metadata datastore are used to read objects from previous timepoints, restore the object-based datastore to a previous snapshot image, enable intelligent deletion of objects of the object-based data store, and more.
-