Invention Grant
- Patent Title: Apparatus and methods for aligning words in bilingual sentences
- Patent Title (中): 双语句子对齐词的装置和方法
-
Application No.: US11137590Application Date: 2005-05-26
-
Publication No.: US07672830B2Publication Date: 2010-03-02
- Inventor: Cyril Goutte , Michel Simard , Kenji Yamada , Eric Gaussier , Arne Mauser
- Applicant: Cyril Goutte , Michel Simard , Kenji Yamada , Eric Gaussier , Arne Mauser
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Fay Sharpe LLP
- Main IPC: G06F17/28
- IPC: G06F17/28 ; G06F17/27

Abstract:
Methods are disclosed for performing proper word alignment that satisfy constraints of coverage and transitive closure. Initially, a translation matrix which defines word association measures between source and target words of a corpus of bilingual translations of source and target sentences is computed. Subsequently, in a first method, the association measures in the translation matrix are factorized and orthogonalized to produce cepts for the source and target words, which resulting matrix factors may then be, optionally, multiplied to produce an alignment matrix. In a second method, the association measures in the translation matrix are thresholded, and then closed by transitivity, to produce an alignment matrix, which may then be, optionally, factorized to produce cepts. The resulting cepts or alignment matrices may then be used by any number of natural language applications for identifying words that are properly aligned.
Public/Granted literature
- US20060190241A1 Apparatus and methods for aligning words in bilingual sentences Public/Granted day:2006-08-24
Information query