Invention Grant
- Patent Title: Data profiling of large datasets
-
Application No.: US15294537Application Date: 2016-10-14
-
Publication No.: US10346421B1Publication Date: 2019-07-09
- Inventor: Jeffrey Heer , Lars Grammel , Sean Philip Kandel , Philip John Vander Broek
- Applicant: Trifacta Inc.
- Applicant Address: US CA San Francisco
- Assignee: Trifacta Inc.
- Current Assignee: Trifacta Inc.
- Current Assignee Address: US CA San Francisco
- Agency: Fenwick & West LLP
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30 ; G06F16/26 ; G06F16/25

Abstract:
A system provides data profile information describing attributes of a dataset. The system determines relative frequency of occurrences of attribute values with respect to a set of bins from a histogram of another attribute. The system presents a user interface that presents statistical information describing attributes of a dataset based on the relative frequency of occurrences of attribute values. The system generates a transformation script based on the user interactions for transforming records of the dataset. The transformation script is configured to preprocess data of the dataset for further analysis.
Public/Granted literature
- US2181346A Musical instrument Public/Granted day:1939-11-28
Information query