Invention Grant
- Patent Title: Methods and systems for matching records and normalizing names
- Patent Title (中): 匹配记录和规范化名称的方法和系统
-
Application No.: US12363057Application Date: 2009-01-30
-
Publication No.: US08190538B2Publication Date: 2012-05-29
- Inventor: Ling Qin Zhang , Mark Wasson , Valentina Templar
- Applicant: Ling Qin Zhang , Mark Wasson , Valentina Templar
- Applicant Address: US OH Miamisburg
- Assignee: LexisNexis Group
- Current Assignee: LexisNexis Group
- Current Assignee Address: US OH Miamisburg
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner, LLP
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
Methods and systems are provided for normalizing strings and for matching records. In one implementation, a string is tokenized into components. Sequences of tags are generated by assigning tags to the components. A sequence of states is determined based on the sequences of tags. A normalized string is generated by normalizing the sequence of the states. A key record including key fields is extracted from a first data source. A candidate record including candidate fields is extracted from a second data source. A numerical record including numerical fields is computed by comparing the key fields and the candidate fields using comparison functions. Matching functions determined by an additive logistic regression method are applied to the numerical fields. Whether the key record and the candidate record are a match is determined based on a sum of results of the matching functions.
Public/Granted literature
- US20100198756A1 METHODS AND SYSTEMS FOR MATCHING RECORDS AND NORMALIZING NAMES Public/Granted day:2010-08-05
Information query