Invention Grant
US09251250B2 Method and apparatus for processing text with variations in vocabulary usage
有权
用于处理具有词汇使用变化的文本的方法和装置
- Patent Title: Method and apparatus for processing text with variations in vocabulary usage
- Patent Title (中): 用于处理具有词汇使用变化的文本的方法和装置
-
Application No.: US13433111Application Date: 2012-03-28
-
Publication No.: US09251250B2Publication Date: 2016-02-02
- Inventor: John R. Hershey , Jonathan Le Roux , Creighton K Heakulani
- Applicant: John R. Hershey , Jonathan Le Roux , Creighton K Heakulani
- Applicant Address: US MA Cambridge
- Assignee: Mitsubishi Electric Research Laboratories, Inc.
- Current Assignee: Mitsubishi Electric Research Laboratories, Inc.
- Current Assignee Address: US MA Cambridge
- Agent Dirk Brinkman; Gene Vinokur
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F17/21 ; G10L15/00 ; G10L15/18 ; G06F17/30

Abstract:
Text is processed to construct a model of the text. The text has a shared vocabulary. The text is partitioned into sets and subsets of texts. The usage of the shared vocabulary in two or more sets is different, and the topics of two or more subsets are different. A probabilistic model is defined for the text. The probabilistic model considers each word in the text to be a token having a position and a word value, and the usage of the shared vocabulary, topics, subtopics, and word values for each token in the text are represented using distributions of random variables in the probabilistic model, wherein the random variables are discrete. Parameters are estimated for the model corresponding to the vocabulary usages, the word values, the topics, and the subtopics associated with the words.
Public/Granted literature
- US20130262083A1 Method and Apparatus for Processing Text with Variations in Vocabulary Usage Public/Granted day:2013-10-03
Information query