Invention Grant
- Patent Title: Identifying nearest neighbors for machine translation
- Patent Title (中): 识别机器翻译的最近邻居
-
Application No.: US12060126Application Date: 2008-03-31
-
Publication No.: US08175864B1Publication Date: 2012-05-08
- Inventor: Moshe Dubiner
- Applicant: Moshe Dubiner
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Harness, Dickey & Pierce, P.L.C.
- Main IPC: G06F17/28
- IPC: G06F17/28 ; G06F17/27

Abstract:
This specification describes technologies relating to identifying nearest neighbors are provided. In one implementation, a method includes using a first and a second collections of n-grams and their associated probabilities to generate a plurality of randomized ranked collections of n-grams of each of the first natural language and the second natural language, each ranked collection of n-grams of the plurality of randomized ranked collection of n-grams having an ordering of n-grams according to a rarity of the n-grams in the respective first collection and the second collection of n-grams; using each of the plurality of ranked collections of n-grams to determine a plurality of signatures, each signature corresponding to a text of a collection of texts; and using the plurality of signatures to identify candidate text pairs within the collection of texts including a plurality of texts in the first and the second natural languages.
Information query