System, computer program product and method for generating embeddings of textual and quantitative data

Invention Grant

US10726058B2 System, computer program product and method for generating embeddings of textual and quantitative data 有权

Please log in to see more content

Patent Title: System, computer program product and method for generating embeddings of textual and quantitative data
Application No.: US16050790

Application Date: 2018-07-31
Publication No.: US10726058B2

Publication Date: 2020-07-28
Inventor: Robert Cary Sparrow
Applicant: Market Advantage, Inc.
Applicant Address: US MN Minneapolis
Assignee: MARKET ADVANTAGE, INC.
Current Assignee: MARKET ADVANTAGE, INC.
Current Assignee Address: US MN Minneapolis
Agency: Xsensus LLP
Main IPC: G06F16/33
IPC: G06F16/33 ; G06K9/62 ; G06N3/04 ; G06F16/35 ; G06F40/242

System, computer program product and method for generating embeddings of textual and quantitative data

Abstract:

A method, computer program product and computer system is disclosed that generates a set of distributed representation vectors from a dataset of textual and non-text data. In one method, a computer system receives a dataset, cleans the received dataset, parses the cleaned dataset to identify known classes of data, extracts data elements from the dataset based on the known classes of data, organizes the extracted data elements into one or more records, compiles a dictionary of unique data elements and associated codes from the one or more records, creates a set of training pairs using permutations of the codes that correspond to data elements within each record, and computes a distributed representation vector for each of the data elements in the dictionary using the set of training pairs.

Public/Granted literature

US20200042611A1 SYSTEM, COMPUTER PROGRAM PRODUCT AND METHOD FOR GENERATING EMBEDDINGS OF TEXTUAL AND QUANTITATIVE DATA Public/Granted day:2020-02-06

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F16/00	信息检索；数据库结构；文件系统结构
G06F16/30	.•非结构文本数据（文档管理系统入G06F 16/93）
G06F16/33	..••查询