Abstract:
Embodiments are directed to efficiently backing up portions of data and to performing a scoped data recovery. In an embodiment, a computer system preloads data images with data corresponding to various different software applications or operating systems. The computer system chunks the data images, so that each data image is divided into multiple data chunks, and where each data chunk is represented by a hash value. The computer system then receives, from a user, portions of delta data representing data differences between the received user data and the data in the preloaded data images. The computer system also generates a logical backup representation that includes the data chunk hash values for the preloaded data images as well as the delta data received from the user. This logical representation allows restoration of the user's data using only the hash values and the delta data.
Abstract:
Embodiments are directed to locating and restoring backed up items using a custom schema and to efficiently transferring recovery data. In an embodiment, a computer system defines a schema that provides data search and retrieval among backup data sets. The schema stores searchable attributes for each database item and leverages a file system to store file system metadata for the data items of the backup sets. The computer system receives a request to find data items among the backup data sets and accesses the schema to determine, from the stored searchable attributes, which recovery points among the backup data sets include the requested data items. The computer system also restores the requested data items from the determined recovery point within the backup data sets.
Abstract:
Data can be protected at a production server in a virtually continuous fashion, without necessarily imposing severe constraints on the source application(s). For example, a production server can create an application-consistent backup of one or more volumes, the backups corresponding to a first instance in time. A volume filter driver can monitor data changes using an in-memory bitmap, while a log file and/or update sequence number journal can keep track of which files have been modified. The volume updates are also consistent for an instance (later) in time. At the next replication cycle, such as every few minutes (however configured), the volume filter driver passes each in-memory bitmap to the physical disk on the production server. The production server then sends the updates to the backup server, which thus stores application-consistent backups for the volume for multiple instances of time.
Abstract:
Aspects of the subject matter described herein relate to a storage architecture. In aspects, an address provided by a data source is translated into a logical storage address of virtual storage. This logical storage address is translated into an identifier that may be used to store data on or retrieve data from a storage system. The address space of the virtual storage is divided into chunks that may be streamed to the storage system.
Abstract:
A computer-implemented method includes creating a first snapshot of at least one virtual machine at a first time. The first snapshot is created at a computing device of a cluster of computing devices configured to share the at least one virtual machine. As an example, each computing device in the cluster may modify the shared virtual machine via a direct input/output (I/O) transaction, bypassing a file-system stack. The first snapshot is transmitted to a backup device. The method includes creating a second snapshot of the at least one virtual machine at a second time and determining a set of changed data blocks associated with a difference between the second snapshot and the first snapshot. The set of changed blocks is transmitted to the backup device.
Abstract:
A backup and restore system may present recovered backup data as local resources and in a read and write manner so that the recovered backup data may be accessed using an application that created the data. In some embodiments, the recovered data may be read only data, but a differencing virtual hard disk (VHD), Common Internet File System protocol, or differencing driver may be used to present a read/write version of the data. Additionally, the data may be presented using iSCSI or other technologies to present the data as local data, even though the data may be remotely located. The backup and restore system may additionally use VHD differencing technologies to create multiple backups.
Abstract:
Methods and systems are disclosed for backup using a metadata virtual hard drive (VHD) and a differential VHD. A particular system includes a memory device that stores files and a backup agent configured to detect changes to the files. The system includes a translation module configured to translate the detected changes into VHD formatted changes and a VHD module configured to write the VHD formatted changes to a metadata VHD stored at the memory device. The VHD module is also configured to generate a differential VHD based on the metadata VHD. A network interface at the system is configured to transmit the metadata VHD and the differential VHD to a remote storage device.
Abstract:
A computer-implemented method includes creating a first snapshot of at least one virtual machine at a first time. The first snapshot is created at a computing device of a cluster of computing devices configured to share the at least one virtual machine. As an example, each computing device in the cluster may modify the shared virtual machine via a direct input/output (I/O) transaction, bypassing a file-system stack. The first snapshot is transmitted to a backup device. The method includes creating a second snapshot of the at least one virtual machine at a second time and determining a set of changed data blocks associated with a difference between the second snapshot and the first snapshot. The set of changed blocks is transmitted to the backup device.
Abstract:
Embodiments are directed to locating and restoring backed up items using a custom schema and to efficiently transferring recovery data. In an embodiment, a computer system defines a schema that provides data search and retrieval among backup data sets. The schema stores searchable attributes for each database item and leverages a file system to store file system metadata for the data items of the backup sets. The computer system receives a request to find data items among the backup data sets and accesses the schema to determine, from the stored searchable attributes, which recovery points among the backup data sets include the requested data items. The computer system also restores the requested data items from the determined recovery point within the backup data sets.