Field Extraction Rules from Clustered Data Samples

    公开(公告)号:US20170286525A1

    公开(公告)日:2017-10-05

    申请号:US15143563

    申请日:2016-04-30

    Applicant: Splunk Inc.

    CPC classification number: G06F16/287 G06F16/2477

    Abstract: The operation of an automatic data input and query system is controlled by well-defined control data. Certain control data may relate to data schemas and direct operations performed by the system to extract fields from machine data. Automatic methods may determine proper field extraction control information by analyzing a sample of data from a source, breaking the sample data into event segments, classifying the segments into groups based on a measure of similarity, determining an operable extraction rule for each group, and storing the resulting extraction model. Data patterns known by the system can be leveraged to perform the event breaking and field identification for the classifying. Embodiments may provide a user interface to view, interact with, and approve the computer-generated extraction model.

Patent Agency Ranking