Invention Publication
- Patent Title: NODE FAILURE SOURCE DETECTION IN DISTRIBUTED COMPUTING ENVIRONMENTS USING MACHINE LEARNING
-
Application No.: US17977010Application Date: 2022-10-31
-
Publication No.: US20240143446A1Publication Date: 2024-05-02
- Inventor: Or Raz
- Applicant: RED HAT, INC.
- Applicant Address: US NC Raleigh
- Assignee: RED HAT, INC.
- Current Assignee: RED HAT, INC.
- Current Assignee Address: US NC Raleigh
- Main IPC: G06F11/14
- IPC: G06F11/14 ; G06N20/00

Abstract:
Sources of node failures in distributed computing environments can be determined using machine learning according to some aspects described herein. For example, prior to rebooting a node in a distributed computing environment, a computing system can execute a software agent to detect a failure with respect to the node. In response to detecting the failure, the computing system can input characteristics for the node into a trained machine learning model. The computing system can receive a source of the failure with respect to the node. The computing system can then automatically execute a recovery operation for the node based on the source of the failure.
Public/Granted literature
- US12013757B2 Node failure source detection in distributed computing environments using machine learning Public/Granted day:2024-06-18
Information query