Dynamic checkpointing systems and methods

    公开(公告)号:US09652338B2

    公开(公告)日:2017-05-16

    申请号:US14571383

    申请日:2014-12-16

    CPC classification number: G06F11/1484

    Abstract: A method for determining a delay in a dynamic, event driven, checkpoint interval. In one embodiment, the method includes the steps of determining the number of network bits to be transferred; determining the target bit transfer rate; calculating the next cycle delay as the number of bits to be transferred divided by the target bit transfer rate. In another aspect, the invention relates to a method for delaying a checkpoint interval. In one embodiment, the method includes the steps of monitoring the transfer of a prior batch of network data and delaying a subsequent checkpoint until the transfer of a prior batch of network data has reached a certain predetermined level of completion. In another embodiment, the predetermined level of completion is 100%.

    REAL-TIME FAULT-TOLERANT CHECKPOINTING

    公开(公告)号:US20220066887A1

    公开(公告)日:2022-03-03

    申请号:US17003808

    申请日:2020-08-26

    Abstract: In part, the disclosure relates to a real-time fault tolerant system. The system may include a first computing device, a second computing, and a hardware interconnect. The first computing device may include one or more memory devices, one or more processors, a first network interface operable to receive device data and transmit output data over a time-slot-based bus, wherein the output data is generated from processing device data, and a first real-time checkpoint engine. The second computing device may include similar components or the same components as the first computing device. The hardware interconnect is operable to permit data exchange between the first computing device and the second computing device. Checkpoints may be generated by checkpoint engines during lower-priority communication time slots allocated on the time slot-based bus to avoid interfering with any real-time communications to or from the first and second computing devices.

    Fault tolerant systems and methods for cache flush coordination

    公开(公告)号:US11263136B2

    公开(公告)日:2022-03-01

    申请号:US16900915

    申请日:2020-06-13

    Abstract: In part, the disclosure relates to a method of performing a checkpoint process in an active-active computer system including a first node and a second node, wherein each node includes an active checkpoint cache, flush cache, and data storage. In various embodiments, flush operations are coordinated between nodes. The method includes receiving a request for a checkpoint operation at the first node; pausing activity at the first node; notifying the second node of the impending checkpoint operation; performing the checkpoint operation, wherein data associated with the checkpoint operation includes the active checkpoint cache and the flush cache; merging the active checkpoint cache into the flush cache; and resuming activity at the first node. The method may also include each node informing the other node of the completion of cache flush operations.

    FAULT TOLERANT SYSTEMS AND METHODS FOR CACHE FLUSH COORDINATION

    公开(公告)号:US20210034523A1

    公开(公告)日:2021-02-04

    申请号:US16900915

    申请日:2020-06-13

    Abstract: In part, the disclosure relates to a method of performing a checkpoint process in an active-active computer system including a first node and a second node, wherein each node includes an active checkpoint cache, flush cache, and data storage. In various embodiments, flush operations are coordinated between nodes. The method includes receiving a request for a checkpoint operation at the first node; pausing activity at the first node; notifying the second node of the impending checkpoint operation; performing the checkpoint operation, wherein data associated with the checkpoint operation includes the active checkpoint cache and the flush cache; merging the active checkpoint cache into the flush cache; and resuming activity at the first node. The method may also include each node informing the other node of the completion of cache flush operations.

    COMPUTER DUPLICATION AND CONFIGURATION MANAGEMENT SYSTEMS AND METHODS

    公开(公告)号:US20210034483A1

    公开(公告)日:2021-02-04

    申请号:US16900914

    申请日:2020-06-13

    Abstract: In part, the disclosure relates to systems and methods to rapidly copy the computer operating system, drivers and applications from a source computer to a target computer using a duplication engine. Once the copy is complete the source computer will resume execution, and the target computer will first alter its configuration (also referred to as a role or personality) and then resume execution conforming to its new configuration as indicated by a profile stored in protected or specialized memory. The profile can be value, a file, or other memory structure and is protected in the sense that the profile (and or the region of memory where it is stored) must not be overwritten by a state transfer from the source computer to the target computer.

    SYSTEMS AND METHODS FOR APPLYING CHECKPOINTS ON A SECONDARY COMPUTER IN PARALLEL WITH TRANSMISSION

    公开(公告)号:US20210034465A1

    公开(公告)日:2021-02-04

    申请号:US16900913

    申请日:2020-06-13

    Abstract: The disclosure relates to a method of checkpointing. The method may include determining, by the primary computer, when to initiate a checkpoint point operation; dividing, at the primary computer, checkpoint data into two or more groups, wherein each group includes one or more pages of memory; transmitting a first group to the secondary computer; upon receiving, by the secondary computer, the first group, correlating memory pages in the first group with pages in memory on the secondary computer; determining, at the secondary computer, which bytes of memory pages of the first group differ from the correlated pages stored in memory in the secondary computer; and applying data from the first group by swapping differences between the memory pages of the first group and the correlated memory pages stored in the secondary computer. Where at least some of these multiple operations are performed in parallel during a subset of the overall checkpoint operation. The simultaneous performance of various memory manage checkpoint operations is advantageous in various fault tolerant systems. The differences may be N-byte differences such as 8-byte differences.

    Method of delaying checkpoints by inspecting network packets

    公开(公告)号:US09760442B2

    公开(公告)日:2017-09-12

    申请号:US14571391

    申请日:2014-12-16

    Abstract: A method of delaying checkpointing in a virtual machine system. In one embodiment, the method includes the steps of examining a network frame to determine if it is a deferrable frame and if the frame is a deferrable frame, delaying a checkpoint associated with the frame. In another embodiment, the deferrable frame is one of a group comprising: an IP packet tagged with the ‘more fragments’ attribute; TCP data segments that lack the PSH flag and carry no flags other than ‘ACK’; and TCP segments that contain no data and carry only the ‘ACK’ flag; and any frame originating from or destined to a designated network address or port number. In still another embodiment, the method includes the step of concatenating the delays due to deferrable frames. In still yet another embodiment, the method further includes setting an upper limit to the amount of delay that can be generated.

    System and method for writing checkpointing data
    19.
    发明授权
    System and method for writing checkpointing data 有权
    用于写入检查点数据的系统和方法

    公开(公告)号:US09251002B2

    公开(公告)日:2016-02-02

    申请号:US14107536

    申请日:2013-12-16

    Abstract: In part, the invention relates to a system and method for writing checkpointing data to a computer having a standby virtual machine for each checkpointed component on a computer having an active virtual machine. In one embodiment, the checkpointing data is processed on a per virtual machine basis. This is performed in a way that allows checkpointing data packets from multiple sources to be transferred asynchronously, subsequently reassembled into a coherent checkpoint message, and applied asynchronously.

    Abstract translation: 部分地,本发明涉及一种用于在具有活动虚拟机的计算机上为每个检查点组件写入具有备用虚拟机的计算机的检查点数据的系统和方法。 在一个实施例中,基于每个虚拟机处理检查点数据。 这是以允许来自多个源的数据包的检查点异步传输的方式执行的,随后重新组合成一个相关的检查点消息,并异步应用。

    System and Method for Writing Checkpointing Data
    20.
    发明申请
    System and Method for Writing Checkpointing Data 有权
    编写检查点数据的系统和方法

    公开(公告)号:US20140201574A1

    公开(公告)日:2014-07-17

    申请号:US14107536

    申请日:2013-12-16

    Abstract: In part, the invention relates to a system and method for writing checkpointing data to a computer having a standby virtual machine for each checkpointed component on a computer having an active virtual machine. In one embodiment, the checkpointing data is processed on a per virtual machine basis. This is performed in a way that allows checkpointing data packets from multiple sources to be transferred asynchronously, subsequently reassembled into a coherent checkpoint message, and applied asynchronously.

    Abstract translation: 部分地,本发明涉及一种用于在具有活动虚拟机的计算机上为每个检查点组件写入具有备用虚拟机的计算机的检查点数据的系统和方法。 在一个实施例中,基于每个虚拟机处理检查点数据。 这是以允许来自多个源的数据包的检查点异步传输的方式执行的,随后重新组合成一个相关的检查点消息,并异步应用。

Patent Agency Ranking