OPPORTUNISTIC FAILOVER IN A HIGH AVAILABILITY CLUSTER

    公开(公告)号:US20180329789A1

    公开(公告)日:2018-11-15

    申请号:US15590929

    申请日:2017-05-09

    Applicant: VMware, Inc.

    Inventor: Jingliang Shang

    CPC classification number: G06F11/203 G06F2201/805 G06F2201/825 H04L67/1097

    Abstract: Failover processing of a service in a cluster of nodes is disclosed. A node brings up a failover instance of a service in response to an indication of a failure in an initial instance of the service. The failover service executes in the background so as not to be visible to users and thus does not receive user requests. The background service is terminated in response to the failure indication being a false positive. The background service is executed in the foreground in response to detecting that the initial instance of the service has failed and processes user requests.

    Optimized lock detection in a change block tracker

    公开(公告)号:US10078641B1

    公开(公告)日:2018-09-18

    申请号:US14985211

    申请日:2015-12-30

    CPC classification number: G06F16/128 G06F11/1448 G06F11/3041 G06F2201/825

    Abstract: A method for optimizing lock detection in a change block tracker (CBT). A volume subject to the backup constantly receives writes. Therefore, it is important to divide writes that occurred before the lock and the write that occurred after the lock. The CBT is installed into kernel at the installation of the backup agent (application). The detection of changes is started at the completion of a previous snapshot. Snapshot creation request (SCR) is generated. Lock request is received by the CBT. Once the lock request is intercepted, the current changes are moved to frozen changes. The CBT processes the SCR and creates a collection of changes—i.e., snapshot with a Name. The CBT moves the current changes to the snapshot Name.

    FIRST-FAILURE DATA CAPTURE DURING LOCKSTEP PROCESSOR INITIALIZATION

    公开(公告)号:US20180089032A1

    公开(公告)日:2018-03-29

    申请号:US15278180

    申请日:2016-09-28

    Inventor: Ying -Yeung Li

    Abstract: Techniques are provided for performing automated operations to enable first-failure data capture functionality during initialization of multiple lockstep processors. Following a hardware reset of two lockstep processors, an indication is received of one or more crosscheck errors regarding the operation of the two lockstep processors. In response to the crosscheck errors, crosscheck first-failure data capture (FFDC) data is saved to one or more memory areas that are persistent across a hardware reset, and it is determined whether a predefined reset threshold has been satisfied. Responsive to determining that the predefined reset threshold has been satisfied, the crosscheck FFDC data from the one or more persistent memory areas is analyzed and one or more crosscheck initialization codes are responsively generated. An additional hardware reset is initiated.

    METHOD AND DEVICE FOR PROCESSING DISTRIBUTED TRANSACTION

    公开(公告)号:US20170308566A1

    公开(公告)日:2017-10-26

    申请号:US15495399

    申请日:2017-04-24

    Abstract: An objective of the present application is to provide a method and device for processing a distributed transaction. Compared with the prior art, in a distributed system, a first device in the present application is a group formed by multiple nodes. The first device acquires a writer lock adding request, sent by an SQL compiler, about a target table of a to-be-processed transaction, and performs writer lock adding processing on the target table on the basis of the writer lock adding request, so as to avoid collision problems in multi-transaction concurrence and to ensure isolation of transactions. If the writer lock adding processing is successful, to-be-processed version information of the target table is sent to the SQL compiler, so as to ensure that the SQL compiler operates a correct version of the target table later. Further, by restarting child nodes which do not respond to the writer lock adding request or migrating service of the child nodes to other physical machines and then updating locking state information of the child nodes, the present application effectively recovers errors and ensures in-group consistency.

Patent Agency Ranking