Invention Grant
- Patent Title: Entity resolution based on character string frequency analysis
-
Application No.: US16506792Application Date: 2019-07-09
-
Publication No.: US11436241B2Publication Date: 2022-09-06
- Inventor: Girish Kunjur
- Applicant: FAIR ISAAC CORPORATION
- Applicant Address: US CA San Jose
- Assignee: FAIR ISAAC CORPORATION
- Current Assignee: FAIR ISAAC CORPORATION
- Current Assignee Address: US CA San Jose
- Agency: Mintz, Levin, Cohn, Ferris, Glovsky and Popeo, P.C.
- Agent F. Jason Far-hadian, Esq.
- Main IPC: G06F16/24
- IPC: G06F16/24 ; G06F40/284 ; G06F16/2458 ; G06F16/2457

Abstract:
Computer-implemented methods, systems and products for character string frequency analysis. The method includes a set of operations or steps, including parsing a plurality of character strings into one or more tokens, categorizing the one or more tokens into one or more token frequency categories, and generating a first similarity score between one or more pairs of character strings of the plurality of character strings. The method further includes calculating one or more degrees of commonality or rarity of the plurality of character strings based on the categorizing, generating one or more penalties for token pairs of the one or more pairs of character strings associated with the first similarity score based on the one or more degrees of commonality or rarity and the categorizing, and generating a second similarity score based the first similarity score and the one or more penalties.
Public/Granted literature
- US20210011909A1 ENTITY RESOLUTION BASED ON CHARACTER STRING FREQUENCY ANALYSIS Public/Granted day:2021-01-14
Information query