Invention Grant
US08600730B2 Language segmentation of multilingual texts 有权
多语言文本语言分割

Language segmentation of multilingual texts
Abstract:
A system and method for segmenting a multi-language text is provided. An exemplary method comprises determining an initial probability distribution for sentences in the multi-language text, the initial probability distribution indicating the likelihood of each sentence being in each of a set of languages. A probability of language transitions across sentences may be learned based on the initial probability distribution. Additionally, a highest probability language sequence of sentences in the multi-language text may be determined based on a combination of the probability of language transitions and the prior probability distribution provided by an initial model.
Public/Granted literature
Information query
Patent Agency Ranking
0/0