-
公开(公告)号:US11797487B2
公开(公告)日:2023-10-24
申请号:US17715204
申请日:2022-04-07
Applicant: AMPERITY, INC.
Inventor: Stephen Meyles , Yan Yan , Dan Suciu , Michael P. Fikes
IPC: G06F7/02 , G06F16/00 , G06F16/174 , G06F16/28 , G06F16/22 , G06F40/197 , G06F17/16
CPC classification number: G06F16/1748 , G06F16/2272 , G06F16/285 , G06F40/197 , G06F16/288 , G06F17/16
Abstract: The present disclosure relates to optimizing one or more database tables that may include one or more redundant records. Records are clustered and assigned stable identifiers. In this manner, the underlying records within a cluster are not removed or deleted. As updates to the database are made, new clustering analyses are performed using the underlying records and any updates made. Newly identified clusters are reassigned stable identifiers.
-
公开(公告)号:US11301426B1
公开(公告)日:2022-04-12
申请号:US16675789
申请日:2019-11-06
Applicant: Amperity, Inc.
Inventor: Stephen Meyles , Yan Yan , Dan Suciu , Michael P. Fikes
IPC: G06F7/02 , G06F16/00 , G06F16/174 , G06F16/22 , G06F16/28 , G06F40/197 , G06F17/16
Abstract: The present disclosure relates to optimizing one or more database tables that may include one or more redundant records. Records are clustered and assigned stable identifiers. In this manner, the underlying records within a cluster are not removed or deleted. As updates to the database are made, new clustering analyses are performed using the underlying records and any updates made. Newly identified clusters are reassigned stable identifiers.
-
公开(公告)号:US11308130B1
公开(公告)日:2022-04-19
申请号:US16678841
申请日:2019-11-08
Applicant: Amperity, Inc.
Inventor: Yan Yan , Stephen Meyles , Mona Akmal , Michael P. Fikes
Abstract: The present disclosure relates to evaluating whether two data records reflect the same entity using a classifier in the absence of ground truth. Without ground truth, it is difficult to determine the precision or recall of a classifier. The present disclosure generates a list comprising a series of unique feature signatures and a set of sample record pairs for each unique feature signature. In some embodiments, users may provide labels for the set of sample record pairs for each unique feature signature.
-
公开(公告)号:US10509809B1
公开(公告)日:2019-12-17
申请号:US15729960
申请日:2017-10-11
Applicant: Amperity, Inc.
Inventor: Yan Yan , Stephen Meyles , Mona Akmal , Michael P. Fikes
Abstract: The present disclosure relates to evaluating whether two data records reflect the same entity using a classifier in the absence of ground truth. Without ground truth, it is difficult to determine the precision or recall of a classifier. The present disclosure generates output data comprising a list of unique signatures generated from a set of records that are compared with each other. The output data may also comprise corresponding record pairs limited to a predetermined sample size for each unique feature signature.
-
-
-