- Patent Title: Determining a degree of similarity of a subset of tabular data arrangements to subsets of graph data arrangements at ingestion into a data-driven collaborative dataset platform
-
Application No.: US17365214Application Date: 2021-07-01
-
Publication No.: US12292870B2Publication Date: 2025-05-06
- Inventor: David Lee Griffith
- Applicant: data.world, Inc.
- Applicant Address: US TX Austin
- Assignee: data.world, Inc.
- Current Assignee: data.world, Inc.
- Current Assignee Address: US TX Austin
- Agency: KOKKA & BACKUS, PC
- Main IPC: G06F16/2455
- IPC: G06F16/2455 ; G06F16/22 ; G06F16/2457 ; G06F16/25 ; G06F16/28 ; G06F16/901

Abstract:
Various techniques are described, including evaluating ingested data including a dataset to identify one or more links to other datasets stored in a graph, using a similarity determination algorithm to identify a degree of similarity between datasets to determine joinability of ingested datasets with graph-stored datasets, determining a ratio to determine whether to perform an overlap or coverage function, associating a subset of similarity matrices with a subset of graph data joined to the ingested dataset, and forming links in a column of data between the dataset and the another dataset of the ingested data based on the degree of similarity.
Public/Granted literature
Information query