Invention Grant
- Patent Title: Node failure source detection in distributed computing environments using machine learning
-
Application No.: US17977010Application Date: 2022-10-31
-
Publication No.: US12013757B2Publication Date: 2024-06-18
- Inventor: Or Raz
- Applicant: RED HAT, INC.
- Applicant Address: US NC Raleigh
- Assignee: Red Hat, Inc.
- Current Assignee: Red Hat, Inc.
- Current Assignee Address: US NC Raleigh
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06F11/14
- IPC: G06F11/14 ; G06N20/00

Abstract:
Sources of node failures in distributed computing environments can be determined using machine learning according to some aspects described herein. For example, prior to rebooting a node in a distributed computing environment, a computing system can execute a software agent to detect a failure with respect to the node. In response to detecting the failure, the computing system can input characteristics for the node into a trained machine learning model. The computing system can receive a source of the failure with respect to the node. The computing system can then automatically execute a recovery operation for the node based on the source of the failure.
Public/Granted literature
- US20240143446A1 NODE FAILURE SOURCE DETECTION IN DISTRIBUTED COMPUTING ENVIRONMENTS USING MACHINE LEARNING Public/Granted day:2024-05-02
Information query