INCREASING COORDINATION SERVICE RELIABITY
    141.
    发明申请
    INCREASING COORDINATION SERVICE RELIABITY 有权
    增加协调服务的可靠性

    公开(公告)号:US20160224432A1

    公开(公告)日:2016-08-04

    申请号:US14609323

    申请日:2015-01-29

    Abstract: The present invention extends to methods, systems, and computer program products for increasing coordination service reliability. A coordination service maintains state (e.g., using replication) for one or more software components (e.g., applications). Tokens can be used to identify incarnations of a member set within the coordination service. When a member starts and has no token, the member attempts to learn the token from a majority other members. If no such token exists, the member requests a new token. Aspects of the invention can be used to detect and compensate for lost state within the coordination service, including state lost due to storage device failures (which may be referred to as “silent data loss”). Detecting and compensating for silent data loss makes the coordination service more reliable and can essentially guarantee that the coordination service notifies clients when data is lost and ceases processing when incorrect state may exist.

    Abstract translation: 本发明扩展到用于增加协调服务可靠性的方法,系统和计算机程序产品。 协调服务维护一个或多个软件组件(例如,应用程序)的状态(例如,使用复制)。 令牌可用于识别协调服务中成员集合的化身。 当成员启动并没有令牌时,成员尝试从大多数其他成员学习令牌。 如果不存在这样的令牌,则成员请求一个新的令牌。 可以使用本发明的方面来检测和补偿协调服务中的丢失状态,包括由于存储设备故障导致的状态丢失(其可以被称为“无声数据丢失”)。 检测和补偿无声数据丢失使得协调服务更可靠,并且可以基本上保证协调服务在数据丢失时通知客户端,并且当存在不正确的状态时停止处理。

    ONLINE RESTORATION OF A SWITCH SNAPSHOT
    142.
    发明申请
    ONLINE RESTORATION OF A SWITCH SNAPSHOT 有权
    在线恢复开关SNAPSHOT

    公开(公告)号:US20160188422A1

    公开(公告)日:2016-06-30

    申请号:US14588106

    申请日:2014-12-31

    Abstract: One embodiment of the present invention provides a switch. The switch includes one or more ports, a persistent storage module, a restoration module, and a retrieval module. The persistent storage module stores configuration information associated with the switch in a data structure, which includes one or more columns for attribute values of the configuration information, in a local persistent storage. The restoration module instantiates a restoration database instance in the persistent storage from an image of the persistent storage. The retrieval module retrieves attribute values from a data structure in a current database instance and the restoration database instance in the persistent storage. The restoration module then applies the differences between attribute values of the restoration database instance and the current database instance in the persistent storage to switch modules of the switch, and operates the restoration database instance as the current database instance in the persistent storage.

    Abstract translation: 本发明的一个实施例提供一种开关。 交换机包括一个或多个端口,持久存储模块,恢复模块和检索模块。 持久性存储模块在本地持久存储器中存储与交换机相关联的配置信息,其中数据结构包括用于配置信息的属性值的一个或多个列。 恢复模块从持久存储器的图像中实例化永久存储器中的恢复数据库实例。 检索模块从当前数据库实例中的数据结构和永久存储中的恢复数据库实例检索属性值。 然后,恢复模块将恢复数据库实例的属性值与当前数据库实例之间的差异应用于永久存储器中以切换交换机的模块,并将恢复数据库实例作为持久存储器中的当前数据库实例进行操作。

    Detecting data loss during site switchover
    144.
    发明授权
    Detecting data loss during site switchover 有权
    在站点切换期间检测数据丢失

    公开(公告)号:US09367413B2

    公开(公告)日:2016-06-14

    申请号:US14266733

    申请日:2014-04-30

    Applicant: NetApp, Inc.

    Abstract: Techniques for detecting data loss during site switchover are disclosed. An example method includes storing at NVRAM of a first node a plurality of operations of a second node, the first and second nodes being disaster recovery partners. The method also includes during a switchover from the second node to the first node, receiving an indication of a first number of operations yet to be completed. The method further includes comparing the first number to a second number of operations in the plurality of operations stored at the NVRAM of the first node. The method also includes in response to the comparing, determining whether at least one operation is missing from the plurality of operations stored in the NVRAM of the first node. The method further includes in response to determining that at least one operation is missing, failing at least one volume.

    Abstract translation: 公开了在现场切换期间检测数据丢失的技术。 一个示例性方法包括在NVRAM的第一节点处存储第二节点的多个操作,第一和第二节点是灾难恢复伙伴。 该方法还包括在从第二节点切换到第一节点期间,接收尚待完成的第一数量的操作的指示。 该方法还包括将第一数量与存储在第一节点的NVRAM中的多个操作中的第二数量的操作进行比较。 该方法还包括响应于比较,确定从存储在第一节点的NVRAM中的多个操作中是否丢失了至少一个操作。 该方法还包括响应于确定至少一个操作丢失,至少一个卷失败。

    Detecting the use of stale data values due to weak consistency
    146.
    发明授权
    Detecting the use of stale data values due to weak consistency 有权
    检测由于弱一致性而使用过时的数据值

    公开(公告)号:US09361176B2

    公开(公告)日:2016-06-07

    申请号:US14314630

    申请日:2014-06-25

    Abstract: An apparatus and method detect the use of stale data values due to weak consistency between parallel threads on a computer system. A consistency error detection mechanism uses object code injection to build a consistency error detection table during the operation of an application. When the application is paused, the consistency error detection mechanism uses the consistency error detection table to detect consistency errors where stale data is used by the application. The consistency error detection mechanism alerts the user/programmer to the consistency errors in the application program.

    Abstract translation: 一种装置和方法检测由于计算机系统上的并行线程之间的弱一致性而使用过时的数据值。 一致性错误检测机制使用对象代码注入在应用程序的操作期间构建一致性错误检测表。 当应用程序暂停时,一致性错误检测机制使用一致性错误检测表来检测应用程序使用陈旧数据的一致性错误。 一致性错误检测机制警告用户/程序员在应用程序中的一致性错误。

    System and method for hybrid kernel- and user-space incremental and full checkpointing
    147.
    发明授权
    System and method for hybrid kernel- and user-space incremental and full checkpointing 有权
    混合内核和用户空间增量和完全检查点的系统和方法

    公开(公告)号:US09354977B1

    公开(公告)日:2016-05-31

    申请号:US14292156

    申请日:2014-05-30

    Abstract: A system includes a multi-process application that runs. A multi-process application runs on primary hosts and is checkpointed by a checkpointer comprised of at least one of a kernel-mode checkpointer module and one or more user-space interceptors providing at least one of barrier synchronization, checkpointing thread, resource flushing, and an application virtualization space. Checkpoints may be written to storage and the application restored from said stored checkpoint at a later time. Checkpointing may be incremental using Page Table Entry (PTE) pages and Virtual Memory Areas (VMA) information. Checkpointing is transparent to the application and requires no modification to the application, operating system, networking stack or libraries. In an alternate embodiment the kernel-mode checkpointer is built into the kernel.

    Abstract translation: 系统包括运行的多进程应用程序。 多进程应用程序在主主机上运行,​​并且由包括内核模式检查指针模块和一个或多个用户空间拦截器中的至少一个的检查点检查点来提供至少一个障碍同步,检查点线程,资源冲洗和 一个应用程序虚拟化空间。 检查点可能会写入存储,并在稍后的时间从所述存储的检查点恢复应用程序。 检查点可以是使用页表项(PTE)页面和虚拟内存区域(VMA)信息的增量。 检查点对应用程序是透明的,不需要修改应用程序,操作系统,网络栈或库。 在一个替代实施例中,内核模式检查指针内置在内核中。

    SYSTEM FOR CROSS-HOST, MULTI-THREAD SESSION ALIGNMENT
    149.
    发明申请
    SYSTEM FOR CROSS-HOST, MULTI-THREAD SESSION ALIGNMENT 审中-公开
    交叉主机系统,多线程会话对齐

    公开(公告)号:US20160142422A1

    公开(公告)日:2016-05-19

    申请号:US14939467

    申请日:2015-11-12

    Abstract: A method of detecting aberrant behavior in a software application is described. The method includes instantiating replicated applications on computing devices using identical initial setting. Each replicated application is a replicated instance of the software application. Information associated with a first API call from the first replicated application, and information associated with a second API call from the second replicated application is received. The information includes a call identifier of the API call and a digest. The call identifier is unique during the lifetime of the replicated application issuing it and is identical across the replicated applications. If the first and second call identifiers are identical, the method determines whether the first and second digests match. The method also includes, in response to the first and second digests not matching, signaling that aberrant behavior has occurred. Apparatus and computer readable media are also described.

    Abstract translation: 描述了一种在软件应用中检测异常行为的方法。 该方法包括使用相同的初始设置实例化计算设备上的复制应用程序。 每个复制的应用程序是软件应用程序的复制实例。 与来自第一复制应用程序的第一API调用相关联的信息以及与来自第二复制应用程序的第二API调用相关联的信息被接收。 该信息包括API调用的呼叫标识符和摘要。 呼叫标识符在发布它的复制应用程序的生存期内是唯一的,并且在复制的应用程序中是相同的。 如果第一和第二呼叫标识符相同,则该方法确定第一和第二摘要是否匹配。 该方法还包括响应于不匹配的第一和第二摘要,表明发生了异常行为。 还描述了装置和计算机可读介质。

Patent Agency Ranking