Process rescue by an OS module after a device error

    公开(公告)号:US09652319B2

    公开(公告)日:2017-05-16

    申请号:US14810439

    申请日:2015-07-27

    Applicant: MEDIATEK INC.

    Abstract: A method for performing process fault tolerant control of an electronic device, and an associated apparatus and an associated computer program product are provided, where the method includes: using at least one driver in a kernel layer of an operating system (OS) of the electronic device to perform detection to determine whether a specific process running on the electronic device will be influenced by an error of the electronic device; and when it is detected that the specific process running on the electronic device will be influenced by the error of the electronic device, using at least one control signal of the OS to perform process control on the specific process and using a package manager service (PMS) module of the OS to trigger a rescue procedure. For example, the method may further include: when triggering the rescue procedure, preventing immediately triggering termination of the specific process.

    System and method for high-speed data recording
    2.
    发明授权
    System and method for high-speed data recording 有权
    用于高速数据记录的系统和方法

    公开(公告)号:US09569312B2

    公开(公告)日:2017-02-14

    申请号:US13625553

    申请日:2012-09-24

    Abstract: A system and method for high speed data recording includes a control computer and a disk pack unit. The disk pack is provided within a shell that provides handling and protection for the disk packs. The disk pack unit provides cooling of the disks and connection for power and disk signaling. A standard connection is provided between the control computer and the disk pack unit. The disk pack units are self sufficient and able to connect to any computer. Multiple disk packs are connected simultaneously to the system, so that one disk pack can be active while one or more disk packs are inactive. To control for power surges, the power to each disk pack is controlled programmatically for the group of disks in a disk pack.

    Abstract translation: 用于高速数据记录的系统和方法包括控制计算机和盘组件。 磁盘包提供在一个外壳内,为磁盘包提供处理和保护。 磁盘组件提供磁盘的冷却和电源和磁盘信号的连接。 在控制计算机和磁盘组件之间提供标准连接。 磁盘包装单元是自给自足的,能够连接到任何计算机。 多个磁盘包同时连接到系统,以便在一个或多个磁盘组处于非活动状态时,一个磁盘组可以处于活动状态。 为了控制电涌,对于磁盘组中的一组磁盘,以编程方式控制每个磁盘组的电源。

    Reconfigurable recovery modes in high availability processors
    3.
    发明授权
    Reconfigurable recovery modes in high availability processors 有权
    高可用性处理器中可重构的恢复模式

    公开(公告)号:US09043641B2

    公开(公告)日:2015-05-26

    申请号:US13785103

    申请日:2013-03-05

    Abstract: A method for performing error recovery that includes creating, by a processor, a recovery checkpoint. The processor is dynamically switched into a non-recoverable processing mode of operation based on creating the software recovery checkpoint. The non-recoverable processing mode of operation is a mode in which a subset of hardware error recovery resources are powered-down or re-purposed for instruction processing. It is determined, during the non-recoverable processing mode of operation, that a new software recovery checkpoint is required. Based on the determining that a new software recovery checkpoint is required, the processor is dynamically switched into a recoverable processing mode of operation. The recoverable processing mode of operation is a mode in which hardware error recovery resources, including at least one of the hardware error recovery resources in the subset, are purposed for hardware error recovery operations.

    Abstract translation: 一种用于执行错误恢复的方法,包括由处理器创建恢复检查点。 基于创建软件恢复检查点,处理器被动态切换到不可恢复的处理操作模式。 不可恢复的处理操作模式是硬件错误恢复资源的子集被掉电或重新用于指令处理的模式。 在不可恢复的处理操作模式下,确定需要新的软件恢复检查点。 基于确定需要新的软件恢复检查点,处理器被动态切换成可恢复的处理操作模式。 可恢复处理操作模式是硬件错误恢复资源(包括该子集中的至少一个硬件错误恢复资源)用于硬件错误恢复操作的模式。

    Reconfigurable recovery modes in high availability processors
    4.
    发明授权
    Reconfigurable recovery modes in high availability processors 有权
    高可用性处理器中可重构的恢复模式

    公开(公告)号:US08954797B2

    公开(公告)日:2015-02-10

    申请号:US13447554

    申请日:2012-04-16

    Abstract: A computer program product for performing error recovery is configured to perform a method that includes creating, by a processor, a recovery checkpoint. The processor is dynamically switched into a non-recoverable processing mode of operation based on creating the software recovery checkpoint. The non-recoverable processing mode of operation is a mode in which a subset of hardware error recovery resources are powered-down or re-purposed for instruction processing. It is determined, during the non-recoverable processing mode of operation, that a new software recovery checkpoint is required. Based on the determining that a new software recovery checkpoint is required, the processor is dynamically switched into a recoverable processing mode of operation. The recoverable processing mode of operation is a mode in which hardware error recovery resources, including at least one of the hardware error recovery resources in the subset, are purposed for hardware error recovery operations.

    Abstract translation: 用于执行错误恢复的计算机程序产品被配置为执行包括由处理器创建恢复检查点的方法。 基于创建软件恢复检查点,处理器被动态切换到不可恢复的处理操作模式。 不可恢复的处理操作模式是硬件错误恢复资源的子集被掉电或重新用于指令处理的模式。 在不可恢复的处理操作模式下,确定需要新的软件恢复检查点。 基于确定需要新的软件恢复检查点,处理器被动态切换成可恢复的处理操作模式。 可恢复处理操作模式是硬件错误恢复资源(包括该子集中的至少一个硬件错误恢复资源)用于硬件错误恢复操作的模式。

    High availability data storage systems and methods
    5.
    发明授权
    High availability data storage systems and methods 有权
    高可用性数据存储系统和方法

    公开(公告)号:US08707085B2

    公开(公告)日:2014-04-22

    申请号:US13173681

    申请日:2011-06-30

    CPC classification number: G06F11/1423 G06F11/1428 G06F11/2005 G06F11/2007

    Abstract: Provided are systems and methods for accessing a storage device from a node when a local connection failure occurs between the node and the storage device. A failure is determined to have occurred at a first node access path between a first node and a storage device that prevents an application at the first node from accessing the storage device from the first node access path. An access request is sent from the first node to a second node. The second node has a second node access path to the storage device. A determination is made that the second node can communicate with the storage device. The storage device is accessed by an application at the first node via the second node access path.

    Abstract translation: 提供了当在节点和存储设备之间发生本地连接故障​​时从节点访问存储设备的系统和方法。 确定在第一节点和存储设备之间的第一节点访问路径发生故障,以防止第一节点处的应用从第一节点访问路径访问存储设备。 从第一节点向第二节点发送访问请求。 第二节点具有到存储设备的第二节点访问路径。 确定第二节点可以与存储设备通信。 存储设备由第一节点处的应用通过第二节点访问路径访问。

    MEMORY STORAGE DEVICE AND REPAIRING METHOD THEREOF
    7.
    发明申请
    MEMORY STORAGE DEVICE AND REPAIRING METHOD THEREOF 有权
    存储器件及其修复方法

    公开(公告)号:US20140013029A1

    公开(公告)日:2014-01-09

    申请号:US13632161

    申请日:2012-10-01

    CPC classification number: G06F11/1666 G06F11/1428

    Abstract: A memory storage device and a repairing method thereof are provided. The memory storage device has a rewritable non-volatile memory module having multiple physical units. The physical units include at least one backup physical unit which is configured to be accessed only by a specific command set and stored with at least one customized data. The method includes receiving a specific read command from a host system for reading the backup physical unit and transmitting the customized data therein to the host system when the memory storage device is capable of receiving and processing commands from the host system, the specific read command belongs to the specific command set; and writing the customized data from the host system into a corresponding physical unit to restore the memory storage device to a factory setting when receiving the writing command from the host system for writing the customized data.

    Abstract translation: 提供了一种存储器存储装置及其修复方法。 存储器存储设备具有具有多个物理单元的可重写非易失性存储器模块。 物理单元包括至少一个备份物理单元,其被配置为仅由特定命令集访问并且与至少一个定制数据一起存储。 该方法包括当存储器存储设备能够接收和处理来自主机系统的命令时,从主机系统接收用于读取备份物理单元的特定读取命令并将其中的定制数据发送到主机系统,该特定读取命令属于 到具体的命令集; 以及当从主机系统接收到用于写入定制数据的写入命令时,将定制数据从主机系统写入相应的物理单元以将存储器存储设备恢复到出厂设置。

    RECONFIGURABLE RECOVERY MODES IN HIGH AVAILABILITY PROCESSORS
    8.
    发明申请
    RECONFIGURABLE RECOVERY MODES IN HIGH AVAILABILITY PROCESSORS 有权
    高可用性处理器中的可重构恢复模式

    公开(公告)号:US20130275801A1

    公开(公告)日:2013-10-17

    申请号:US13447554

    申请日:2012-04-16

    Abstract: A computer program product for performing error recovery is configured to perform a method that includes creating, by a processor, a recovery checkpoint. The processor is dynamically switched into a non-recoverable processing mode of operation based on creating the software recovery checkpoint. The non-recoverable processing mode of operation is a mode in which a subset of hardware error recovery resources are powered-down or re-purposed for instruction processing. It is determined, during the non-recoverable processing mode of operation, that a new software recovery checkpoint is required. Based on the determining that a new software recovery checkpoint is required, the processor is dynamically switched into a recoverable processing mode of operation. The recoverable processing mode of operation is a mode in which hardware error recovery resources, including at least one of the hardware error recovery resources in the subset, are purposed for hardware error recovery operations.

    Abstract translation: 用于执行错误恢复的计算机程序产品被配置为执行包括由处理器创建恢复检查点的方法。 基于创建软件恢复检查点,处理器被动态切换到不可恢复的处理操作模式。 不可恢复的处理操作模式是硬件错误恢复资源的子集被掉电或重新用于指令处理的模式。 在不可恢复的处理操作模式下,确定需要新的软件恢复检查点。 基于确定需要新的软件恢复检查点,处理器被动态切换成可恢复的处理操作模式。 可恢复处理操作模式是硬件错误恢复资源(包括该子集中的至少一个硬件错误恢复资源)用于硬件错误恢复操作的模式。

    Proxy Responder for Handling Anomalies in a Hardware System
    9.
    发明申请
    Proxy Responder for Handling Anomalies in a Hardware System 有权
    处理硬件系统异常的代理响应程序

    公开(公告)号:US20130262918A1

    公开(公告)日:2013-10-03

    申请号:US13435613

    申请日:2012-03-30

    CPC classification number: G06F11/0757 G06F11/079 G06F11/1428

    Abstract: An apparatus for handling anomalies in a hardware system including a master device and at least one slave device coupled with the master device through an interconnect device is provided. The apparatus includes at least one controller operative to receive status information relating to the slave device. The status information is indicative of whether an anomaly is present in the slave device and/or the interconnect device. The controller is operative to generate output response information as a function of the status information relating to the slave device for detecting and/or responding to hardware system anomalies in a manner which reduces a need for resetting the hardware system to return to normal operation.

    Abstract translation: 提供一种用于处理硬件系统中的异常的装置,包括主设备和通过互连设备与主设备耦合的至少一个从设备。 该装置包括至少一个控制器,用于接收与从属设备相关的状态信息。 状态信息表示从属设备和/或互连设备中是否存在异常。 控制器可操作以根据与从属设备有关的状态信息的函数产生输出响应信息,用于以减少硬件系统复位以恢复正常操作的方式检测和/或响应硬件系统异常。

    Methods and Systems for Repairing Memory
    10.
    发明申请
    Methods and Systems for Repairing Memory 有权
    修复记忆的方法和系统

    公开(公告)号:US20130151890A1

    公开(公告)日:2013-06-13

    申请号:US13313898

    申请日:2011-12-07

    CPC classification number: G06F11/1428

    Abstract: In accordance with embodiments of the present disclosure, a method may comprise identifying one or more portions of the memory having defects. The method may also include storing one or more addresses in the memory defect list, each of the one or more addresses associated with a portion of the one or more identified portions. The method may further include indicating to components of an information handling system that the one or more identified portions are unusable such that the other components are prevented from allocating and using the one or more identified portions.

    Abstract translation: 根据本公开的实施例,方法可以包括识别具有缺陷的存储器的一个或多个部分。 所述方法还可以包括将一个或多个地址存储在所述存储器缺陷列表中,所述一个或多个地址中的每一个与所述一个或多个识别的部分的一部分相关联。 该方法还可以包括向信息处理系统的组件指示所述一个或多个识别的部分不可用,使得其他组件被阻止分配和使用所述一个或多个所识别的部分。

Patent Agency Ranking