Method for reconciling similar data sets
Abstract:
A method synchronizes related data across networked host devices. At each of a first and a second host device, string vectors are created for each document stored within the host device. The respective set of string vectors are encoded using a two-dimensional hash, where a first dimension of the two-dimensional hash stores string vector differences between all elements that reside in a symmetric difference and a second dimension of the two-dimensional hash stores one string vector from the symmetric difference. The respective encoded set of string vectors is transmitted to the other host device, which then decodes the respective encoded set of string vectors received to arrive at the symmetric difference. The host device determines which string vectors it is missing and requests from the other host device the missing documents pertaining to the missing string vectors. The missing documents are received by the requesting host device.
Public/Granted literature
Information query
Patent Agency Ranking
0/0