LIGHTWEIGHT DATA RECONSTRUCTION BASED ON BACKUP DATA

    公开(公告)号:US20230315589A1

    公开(公告)日:2023-10-05

    申请号:US18132903

    申请日:2023-04-10

    Abstract: An information management system allows a user to search through a secondary copy of data, such as a backup, archive, or snapshot without first retrieving the secondary copy of data. Instead, the system constructs lightweight data that can be displayed to a user as a representation of the search results. Lightweight data may include metadata or other information that identifies data included in the secondary copy of data. The lightweight data may be perceived as being the secondary copy of data and allow a user to browse through search results. Once the user identifies a search result that is of interest, information in the lightweight data can be used to retrieve the secondary copy of data. Because lightweight data may have a smaller file size than the file size of the secondary copy of data, the latency of performing a search may be reduced.

    SYSTEMS AND METHODS FOR EXPORTING AND/OR IMPORTING DATA

    公开(公告)号:US20230315581A1

    公开(公告)日:2023-10-05

    申请号:US17707048

    申请日:2022-03-29

    CPC classification number: G06F11/1464 G06F11/1469 G06F2201/84

    Abstract: Systems and methods for exporting data from a cloud server and/or importing data to a cloud server are disclosed. The method for exporting data includes receiving a plurality of event records. Each event record includes information about an update to application data of an application. The computer-implemented method further including storing the plurality of event records in near real-time in a predefined storage location and in response to a client export request, creating one or more backup files based on the stored event records in the predefined storage location, and allowing export of the one or more backup files to a local client storage location.

    Efficient recovery in continuous data protection environments

    公开(公告)号:US11775399B1

    公开(公告)日:2023-10-03

    申请号:US17656687

    申请日:2022-03-28

    CPC classification number: G06F11/1469 G06F2201/84

    Abstract: A computer-implemented method, a computer system and a computer program product efficiently select restore points in a continuous data protection environment. The method includes receiving log entries that include restore points that correspond to data stored on nodes in the continuous data protection environment. The method also includes identifying an interesting restore point from the log entries. The method further includes grouping the interesting restore point for recovery based on one or more of a confidence score and a restore time. In addition, the method includes loading the group of interesting restore points on available nodes in the continuous data protection environment. The method also includes determining whether the data corresponding to each interesting restore point in the group is valid using a validation function. Lastly, the method includes discarding the interesting restore point when the data corresponding to the interesting restore point is not valid.

    ADAPTIVE DATA MOVER RESOURCE ALLOCATION IN SCALABLE DATA PROTECTION ENVIRONMENT

    公开(公告)号:US20230305929A1

    公开(公告)日:2023-09-28

    申请号:US17656386

    申请日:2022-03-24

    CPC classification number: G06F11/1453 G06F11/1464 G06F2201/84

    Abstract: One example method includes obtaining respective information concerning each asset in a group of assets, performing a deduplication check to identify an entity that will perform deduplication of backups of the assets, based on the information obtained concerning the group of assets, and based on an outcome of the deduplication check, sizing one or more proxy instances that will be needed to create the backups, spawning the proxy instances, and using the proxy instances to create the backups of the data assets.

    Manifest-based snapshots in distributed computing environments

    公开(公告)号:US11768739B2

    公开(公告)日:2023-09-26

    申请号:US16943674

    申请日:2020-07-30

    Applicant: Cloudera, Inc.

    CPC classification number: G06F11/1464 G06F16/27 G06F11/1456 G06F2201/84

    Abstract: Scalable architectures, systems, and services are provided herein for creating manifest-based snapshots in distributed computing environments. In some embodiments, responsive to receiving a request to create a snapshot of a data object, a master node identifies multiple slave nodes on which a data object is stored in the cloud-computing platform and creates a snapshot manifest representing the snapshot of the data object. The snapshot manifest comprises a file including a listing of multiple file names in the snapshot manifest and reference information for locating the multiple files in the distributed database system. The snapshot can be created without disrupting I/O operations, e.g., in an online mode by various region servers as directed by the master node. Additionally, a log roll approach to creating the snapshot is also disclosed in which log files are marked. The replaying of log entries can reduce the probability of causal consistency in the snapshot.

Patent Agency Ranking