Invention Grant
- Patent Title: Document alignment systems for legacy document conversions
- Patent Title (中): 用于旧文档转换的文档对齐系统
-
Application No.: US11315458Application Date: 2005-12-22
-
Publication No.: US07882119B2Publication Date: 2011-02-01
- Inventor: Andre Bergholz , Boris Chidlovskii
- Applicant: Andre Bergholz , Boris Chidlovskii
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Fay Sharpe LLP
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
A method for aligning documents which may be in different XML formats includes inputting source and target leaves of a source and documents in first and second tree structured formats and assigning a cost to each of a plurality of matches. Each match may include a source leaf and a target leaf or be an unmatched source or target leaf. Matches are identified for which a total cost is minimal, wherein each of the leaves is in at least one of the identified matches. From the identified matches, groups of two or more matches are identified which have a leaf in common. From the groups, probable matches are identified in which more that one target leaf is matched with at least one source leaf or more than one source leaf is matched with a target leaf. An alignment between leaves of the target document and leaves of the source document is output which includes the probable matches.
Public/Granted literature
- US20070150443A1 Document alignment systems for legacy document conversions Public/Granted day:2007-06-28
Information query