Invention Grant
- Patent Title: Matching subsets of tabular data arrangements to subsets of graphical data arrangements at ingestion into data-driven collaborative datasets
-
Application No.: US17004570Application Date: 2020-08-27
-
Publication No.: US11669540B2Publication Date: 2023-06-06
- Inventor: David Lee Griffith
- Applicant: data.world, Inc.
- Applicant Address: US TX Austin
- Assignee: data.world, Inc.
- Current Assignee: data.world, Inc.
- Current Assignee Address: US TX Austin
- Agency: Kokka & Backus, PC
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F16/25 ; G06F16/901 ; G06F16/22

Abstract:
Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to identify and match equivalent subsets of data between an ingested dataset, such as in a tabular data arrangement, and one or more graph-based data arrangements, according to at least some examples. For example, a method may include identifying a tabular data arrangement including a subset of data as a column, computing a compressed data representation for a column of data, correlating a compressed data representation to a reference compressed data representations, detecting a link between a column of data associated with a correlated compressed data representation to a dataset stored in a graph data arrangement, and forming an expanded tabular data arrangement.
Public/Granted literature
Information query