Invention Grant
US09576248B2 Record linkage sharing using labeled comparison vectors and a machine learning domain classification trainer 有权
使用标记的比较向量和机器学习域分类培训器记录链接共享

  • Patent Title: Record linkage sharing using labeled comparison vectors and a machine learning domain classification trainer
  • Patent Title (中): 使用标记的比较向量和机器学习域分类培训器记录链接共享
  • Application No.: US14203784
    Application Date: 2014-03-11
  • Publication No.: US09576248B2
    Publication Date: 2017-02-21
  • Inventor: Adam M. Hurwitz
  • Applicant: Adam M. Hurwitz
  • Agency: IDP Patent Services
  • Agent Olav M. Underdal
  • Main IPC: G06F15/18
  • IPC: G06F15/18 G06N99/00 G06F17/30
Record linkage sharing using labeled comparison vectors and a machine learning domain classification trainer
Abstract:
Herein disclosed is a system and method for record linkage that uses machine learning to link records, so that many users can contribute their training data to a shared repository and employ the accumulated training data without any user having to share their actual data. The system includes a record linkage server, which further includes a record linkage repository, a domain classifier, and a domain classification trainer. The record linkage server is connected with a record linkage client, which includes a field comparator and a manual label prompter. Further disclosed is a method for record linkage, describing how two structured data sets can be matched, including searching domains, loading data sets, loading domain, matching fields, iterating record linking for all record pairs, including: selecting record pair, calculating comparison vector, calculating label probabilities, determining label, optionally setting label manually, updating prior probabilities, optionally confirming selected label, and updating training data.
Public/Granted literature
Information query
Patent Agency Ranking
0/0