Merging database tables by classifying comparison signatures

    公开(公告)号:US11442694B1

    公开(公告)日:2022-09-13

    申请号:US16787576

    申请日:2020-02-11

    Applicant: Amperity, Inc.

    Abstract: The present disclosure relates to merging database tables. Systems and methods may involve performing a comparison between the first set of records and the second set of records and identifying a plurality of record pairs based on the comparison. Each record pair may comprise a record in the first set of records and a record in the second set of records. In addition, A feature signature may be generated for each record pair by comparing field values in each record pair. The feature signature may be classified to identify at least one related record pair. A merged database table may be generated such that it comprises the at least one related record pair and comprises a set of unique records among selected from the first set of records and the second set of records.

    Effectively fusing database tables

    公开(公告)号:US10853033B1

    公开(公告)日:2020-12-01

    申请号:US15729931

    申请日:2017-10-11

    Applicant: Amperity, Inc.

    Abstract: The present disclosure relates to fuse multiple database tables together. The fields of the database tables may be normalized using semantic fields. Under a first approach, database tables are deduplicated by consolidating redundant records. This may be done by performing pairwise comparisons to identify related pairs of records and then clustering the related pairs of records. Then, the deduplicated database tables are merged by performing another pairwise comparison. Under a second approach, the database tables may be concatenated. Thereafter, records are subject to pairwise comparisons and then clustered to create a merged database table.

Patent Agency Ranking