Management of RAID error recovery procedures and configuration

    公开(公告)号:GB2504956A

    公开(公告)日:2014-02-19

    申请号:GB201214498

    申请日:2012-08-14

    Applicant: IBM

    Abstract: A resource system, e.g. a Redundant Arrays of Independent Disks (RAID) system, comprises a plurality of resource elements (14), e.g. the drives making up the array, and a resource controller (10) connected to the resource elements and operating the resource elements according to a predefined set of operational goals, e.g. operation goals of the RAID level being implemented including RAID availability, redundancy, performance or error recovery procedure duration. A method of operating the resource system comprises the steps of identifying error recovery procedures that could be executed by the resource elements, categorising each identified error recovery procedure in relation to the predefined set of operational goals, detecting that an error recovery procedure is to be performed on a specific resource clement, deploying one or more actions in relation to the resource elements according to the categorisation of the detected error recovery procedure (e.g. scheduling error recovery procedures or swapping data to a different resource element), and performing the detected error recovery procedure on the specific resource element.

    Method and apparatus for testing a storage system

    公开(公告)号:GB2513333A

    公开(公告)日:2014-10-29

    申请号:GB201307301

    申请日:2013-04-23

    Applicant: IBM

    Abstract: A method and a system for testing a storage system to which is applied a command or a sequence of commands. The storage system has a storage medium 2 and a controller 3, and each command results in an outcome (which may be for example success or failure). The method comprises: storing in a dataset 14 information related to the command 17 and/or the sequence of commands including for each command 8 an address 156 of the storage system the command 8 is applied to, and an outcome of the command. When a sequence of commands is applied, the information stored in the dataset 14 includes an outcome of the sequence of commands. This method further comprises selecting one or more commands from the dataset to be subsequently replayed when the outcome of the at least one command indicates an error. The outcome of the at least one command may be a fault which may be an error on the storage medium address or an error internal to the storage medium or a faulty behavior of the storage system. Alternatively the fault may be an error which occurred during transportation of data between the controller and the storage medium and in particular where the delay between command and response is longer or excessive as compared to a predetermined delay.

Patent Agency Ranking