DATABASE WORKLOAD ANALYSIS AND OPTIMIZATION VISUALIZATIONS

    公开(公告)号:US20170132296A1

    公开(公告)日:2017-05-11

    申请号:US15345375

    申请日:2016-11-07

    Applicant: Cloudera, Inc.

    Inventor: Yihua Ding

    CPC classification number: G06F17/30554

    Abstract: Techniques are described for analyzing usage of data stored in a data storage system without accessing the stored data. In some embodiments, workload data indicative of queries executed at the data storage system on stored data is received. This workload data can include query logs generated during execution of the queries. The workload data is processed to identify data elements such as tables, columns, and views associated with the stored data as well as information regarding usage of the identified data elements. Usage can include operations performed on the data elements during execution of the queries. Based on this processing relationships between the identified data elements can be inferred and visualizations generated that convey information regarding usage of the data stored at the data storage system. Visualizations can include, among others, usage heatmap diagrams, join diagrams, column family diagrams, filter diagrams, view lineage diagrams, data flow diagrams, denormalization diagrams, and workload distribution diagrams.

    Database workload analysis and optimization visualizations

    公开(公告)号:US10255335B2

    公开(公告)日:2019-04-09

    申请号:US15345375

    申请日:2016-11-07

    Applicant: Cloudera, Inc.

    Inventor: Yihua Ding

    Abstract: Techniques are described for analyzing usage of data stored in a data storage system without accessing the stored data. In some embodiments, workload data indicative of queries executed at the data storage system on stored data is received. This workload data can include query logs generated during execution of the queries. The workload data is processed to identify data elements such as tables, columns, and views associated with the stored data as well as information regarding usage of the identified data elements. Usage can include operations performed on the data elements during execution of the queries. Based on this processing relationships between the identified data elements can be inferred and visualizations generated that convey information regarding usage of the data stored at the data storage system. Visualizations can include, among others, usage heatmap diagrams, join diagrams, column family diagrams, filter diagrams, view lineage diagrams, data flow diagrams, denormalization diagrams, and workload distribution diagrams.

Patent Agency Ranking