Invention Grant
US08175864B1 Identifying nearest neighbors for machine translation 有权
识别机器翻译的最近邻居

Identifying nearest neighbors for machine translation
Abstract:
This specification describes technologies relating to identifying nearest neighbors are provided. In one implementation, a method includes using a first and a second collections of n-grams and their associated probabilities to generate a plurality of randomized ranked collections of n-grams of each of the first natural language and the second natural language, each ranked collection of n-grams of the plurality of randomized ranked collection of n-grams having an ordering of n-grams according to a rarity of the n-grams in the respective first collection and the second collection of n-grams; using each of the plurality of ranked collections of n-grams to determine a plurality of signatures, each signature corresponding to a text of a collection of texts; and using the plurality of signatures to identify candidate text pairs within the collection of texts including a plurality of texts in the first and the second natural languages.
Information query
Patent Agency Ranking
0/0