Invention Grant
- Patent Title: System and method for comparing data sets
- Patent Title (中): 用于比较数据集的系统和方法
-
Application No.: US10776057Application Date: 2004-02-11
-
Publication No.: US07921110B1Publication Date: 2011-04-05
- Inventor: Daniel Ting , Stephen L. Manley , Rimas Svarcas
- Applicant: Daniel Ting , Stephen L. Manley , Rimas Svarcas
- Applicant Address: US CA Sunnyvale
- Assignee: NetApp, Inc.
- Current Assignee: NetApp, Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Cesari and McKenna, LLP
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
The present invention provides a system and method for comparing data sets, to ensure that they are accurate reflections of each other, without the need for performing O(N2) operations, in which N is the size of each data set. A hash table is generated for the first data set. For each of the second data set entries, should the entry not exist in the hash table, the entry is second data set unique. Otherwise, the entry is removed from the hash table. At the end of the pass through the second data set entries, only those entries that exist in the hash table are first data set unique. Alternately, two processes operate in parallel so that each selects entries from one of the data sets and determines if the entry exists in the hash table. If the entry does exist, it is removed. Otherwise, the entry is added to the hash table.
Information query