Invention Grant
US08972244B2 Sampling and optimization in phrase-based machine translation using an enriched language model representation
有权
使用丰富的语言模型表示在基于短语的机器翻译中的抽样和优化
- Patent Title: Sampling and optimization in phrase-based machine translation using an enriched language model representation
- Patent Title (中): 使用丰富的语言模型表示在基于短语的机器翻译中的抽样和优化
-
Application No.: US13750338Application Date: 2013-01-25
-
Publication No.: US08972244B2Publication Date: 2015-03-03
- Inventor: Marc Dymetman , Wilker Ferreira Aziz , Sriram Venkatapathy
- Applicant: Xerox Corporation
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Fay Sharpe LLP
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F17/20 ; G06F17/28 ; G10L15/00

Abstract:
Rejection sampling is performed to acquire at least one target language translation for a source language string s in accordance with a phrase-based statistical translation model p(x)=p(t, a|s) where t is a candidate translation, a is a candidate alignment comprising a biphrase sequence generating the candidate translation t, and x is a sequence representing the candidate alignment a. The rejection sampling uses a proposal distribution comprising a weighted finite state automaton (WFSA) q(n) that is refined responsive to rejection of a sample x* obtained in a current iteration of the rejection sampling to generate a refined WFSA q(n+1) for use in a next iteration of the rejection sampling. The refined WFSA q(n+1) is selected to satisfy the criteria p(x)≦q(n+1)(x)≦q(n)(x) for all x∈X and q(n+1)(x*)
Public/Granted literature
Information query