Invention Grant
- Patent Title: Adaptive construction of a statistical language model
- Patent Title (中): 统计语言模型的自适应构建
-
Application No.: US12684749Application Date: 2010-01-08
-
Publication No.: US08577670B2Publication Date: 2013-11-05
- Inventor: Kuansan Wang , Xiaolong Li , Jiangbo Miao , Frederic H. Behr, Jr.
- Applicant: Kuansan Wang , Xiaolong Li , Jiangbo Miao , Frederic H. Behr, Jr.
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Main IPC: G06F17/27
- IPC: G06F17/27

Abstract:
A statistical language model (SLM) may be iteratively refined by considering N-gram counts in new data, and blending the information contained in the new data with the existing SLM. A first group of documents is evaluated to determine the probabilities associated with the different N-grams observed in the documents. An SLM is constructed based on these probabilities. A second group of documents is then evaluated to determine the probabilities associated with each N-gram in that second group. The existing SLM is then evaluated to determine how well it explains the probabilities in the second group of documents, and a weighting parameter is calculated from that evaluation. Using the weighting parameter, a new SLM is then constructed as a weighted average of the existing SLM and the new probabilities.
Public/Granted literature
- US20110172988A1 ADAPTIVE CONSTRUCTION OF A STATISTICAL LANGUAGE MODEL Public/Granted day:2011-07-14
Information query