Invention Grant
- Patent Title: Clustering and labeling streamed data
-
Application No.: US15493036Application Date: 2017-04-20
-
Publication No.: US10698926B2Publication Date: 2020-06-30
- Inventor: Shandan Zhou , Karthikeyan Subramanian , Murtaza Muidul Huda Chowdhury , Gowtham Natarajan
- Applicant: Microsoft Technology Licesning, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Ray Quinney & Nebeker, P.C.
- Agent Thomas M. Hardman; Brandon K. Roper
- Main IPC: G06F16/28
- IPC: G06F16/28 ; G06F11/07 ; G06F16/2455

Abstract:
Aspects extend to methods, systems, and computer program products for clustering streamed or batch data. Aspects of the invention include dynamic clustering and labeling of streamed data and/or batch data, including failures and error logs (user, platform, etc.), latency logs, warning logs, information logs, Virtual Machine (VM) creation data logs, template logs, etc., for use in analysis (e.g., error log analysis). A clustering system can learn from previously identified patterns and use that information to group newer information dynamically as it gets generated. The clustering system can leverage streamed data and/or batch data domain knowledge for preprocessing. In one aspect, a clustering system uses a similarity measure. Based on (e.g., users' configuration of) a similarity threshold, the cluster system (e.g., automatically) assigns/clusters streamed data and/or batch data into groups.
Public/Granted literature
- US20180307740A1 CLUSTERING AND LABELING STREAMED DATA Public/Granted day:2018-10-25
Information query