Patent search ap:("MICROSOFT CORPORATION") AND inv:"ACERO Page Alejandro"

11.

发明申请
AUTOMATIC READING TUTORING WITH PARALLEL POLARIZED LANGUAGE MODELING 审中-公开
Title translation: 具有平行极化语言建模的自动阅读引导

公开(公告)号：WO2008089469A1

公开(公告)日：2008-07-24

申请号：PCT/US2008/051582

申请日：2008-01-21

Applicant: MICROSOFT CORPORATION

Inventor： LI, Xiaolong , JU, Yun-cheng , DENG, Li , ACERO, Alejandro

IPC: G06F17/28

CPC classification number: G06F17/271 , G09B17/003 , G10L15/197 , G10L2015/221

Abstract: A novel system for automatic reading tutoring provides effective error detection and reduced false alarms combined with low processing time burdens and response times short enough to maintain a natural, engaging flow of interaction. According to one illustrative embodiment, an automatic reading tutoring method includes displaying a text output and receiving an acoustic input. The acoustic input is modeled with a domain-specific target language model specific to the text output, and with a general-domain garbage language model, both of which may be efficiently constructed as context-free grammars. The domain-specific target language model may be built dynamically or "on-the-fly" based on the currently displayed text (eg the story to be read by the user), while the general-domain garbage language model is shared among all different text outputs. User-perceptible tutoring feedback is provided based on the target language model and the garbage language model.

Abstract translation: 用于自动阅读辅导的新颖系统提供了有效的错误检测和减少的假警报以及较短的处理时间负担和响应时间足够短以保持自然的，互动的互动流。根据一个说明性实施例，自动阅读辅导方法包括显示文本输出并接收声输入。声输入是用专门针对文本输出的领域特定的目标语言模型建立的，并且具有通用域垃圾语言模型，这两种语言模型都可以被有效地构建为无上下文的语法。可以基于当前显示的文本（例如，用户要阅读的故事）动态地或“即时”地构建域特定目标语言模型，而一般域垃圾语言模型在所有不同的方式之间共享文本输出。基于目标语言模型和垃圾语言模型提供了用户可感知的辅导反馈。

12.

发明申请
DETECTING AN ANSWERING MACHINE USING SPEECH RECOGNITION 审中-公开
Title translation: 使用语音识别检测答案机

公开(公告)号：WO2008008117A1

公开(公告)日：2008-01-17

申请号：PCT/US2007/011567

申请日：2007-05-15

Applicant: MICROSOFT CORPORATION

Inventor： ACERO, Alejandro , FISHER, Craig M. , YU, Dong , WANG, Ye-Yi , JU, Yu-Cheng

IPC: G10L15/22 , G10L15/00 , G10L15/06 , H04M1/67

CPC classification number: G10L25/78 , G10L15/22 , G10L15/26 , H04M3/5158 , H04M2203/2027

Abstract: An answering machine detection module is used to determine whether a call recipient is an actual person or an answering machine. The answering machine detection module includes a speech recognizer and a call analysis module. The speech recognizer receives an audible response of the call recipient to a call. The speech recognizer processes the audible response and provides an output indicative of recognized speech. The call analysis module processes the output of the speech recognizer to generate an output indicative of whether the call recipient is a person or an answering machine.

Abstract translation: 应答机检测模块用于确定呼叫接收者是实际的人还是应答机。应答机检测模块包括语音识别器和呼叫分析模块。语音识别器接收呼叫接收者对呼叫的可听见的响应。语音识别器处理可听见的响应并提供表示识别的语音的输出。呼叫分析模块处理语音识别器的输出以产生指示呼叫接收者是人还是应答机的输出。

13.

发明公开
COMBINED SPEECH AND ALTERNATE INPUT MODALITY TO A MOBILE DEVICE 有权
Title translation: 用于移动设备的联合语文及备用输入方式

公开(公告)号：EP1941344A1

公开(公告)日：2008-07-09

申请号：EP06817051.3

申请日：2006-10-16

Applicant: Microsoft Corporation

Inventor： MAHAJAN, Milind V. , ACERO, Alejandro , HSU, Bo-June

IPC: G06F3/16 , G10L15/22

CPC classification number: G10L15/22

Abstract: Both speech and alternate modality inputs are used in inputting information spoken into a mobile device. The alternate modality inputs can be used to perform sequential commitment of words in a speech recognition result.

14.

发明授权
METHOD AND APPARATUS FOR PITCH TRACKING 有权
Title translation: 方法和设备的基本频率识别

公开(公告)号：EP1145224B1

公开(公告)日：2006-06-07

申请号：EP99959072.2

申请日：1999-11-22

Applicant: MICROSOFT CORPORATION

Inventor： ACERO, Alejandro , DROPPO, James, G., III

IPC: G10L11/04 , G10L11/06

CPC classification number: G10L25/93 , G10L25/06 , G10L25/90

Abstract: In a method for tracking pitch in a speech signal (200), first and second window vectors, x>t t-p, are created from samples (414, 416, 418, 408, 410, 412) taken across first and second windows (402, 400) of the speech signal. The first window (402) is separated from the second window (400) by a test pitch period (406). The energy of the speech signal in the first window is combined with the correlation between the first window vector and the second window vector to produce a predictable energy factor. The predictable energy factor is then used to determine a pitch score for the test pitch period. Based in part on the pitch score, a portion of the pitch track is identified.

15.

发明公开
INDEXING AND SEARCHING SPEECH WITH TEXT META-DATA 审中-公开
Title translation: 索引和语言的文本元数据搜索

公开(公告)号：EP1952270A1

公开(公告)日：2008-08-06

申请号：EP06827328.3

申请日：2006-10-31

Applicant: Microsoft Corporation

Inventor： ACERO, Alejandro , CHELBA, Ciprian I. , SANCHEZ, Jorge Silva F.

IPC: G06F17/20 , G06F17/28 , G06F17/30

CPC classification number: G06F17/30778 , G06F17/30746 , G06F17/30749 , G10L15/197

Abstract: An index for searching spoken documents having speech data and text meta-data is created by obtaining probabilities of occurrence of words and positional information of the words of the speech data and combining it with at least positional information of the words in the text meta-data. A single index can be created because the speech data and the text meta-data are treated the same and considered only different categories .

16.

发明公开
MULTI-SENSORY SPEECH ENHANCEMENT USING A SPEECH-STATE MODEL 有权
Title translation: 多感官语音GAIN使用一个语言的地位MODEL

公开(公告)号：EP1891624A2

公开(公告)日：2008-02-27

申请号：EP06772956.6

申请日：2006-06-13

Applicant: MICROSOFT CORPORATION

Inventor： ZHANG, Zhengyou , LIU, Zicheng , ACERO, Alejandro , SUBRAMANYA, Amarnag , DROPPO, James, G.

IPC: G10L15/20

CPC classification number: G10L21/0208 , G10L2021/02165

Abstract: A method and apparatus determine a likelihood of a speech state based on an alternative sensor signal (316) and an air conduction microphone signal (318). The likelihood of the speech state is used, together with the alternative sensor signal and the air conduction microphone signal, to estimate (322) a clean speech value for a clean speech signal (324).

17.

发明授权
MULTI-SENSORY SPEECH ENHANCEMENT USING A SPEECH-STATE MODEL 有权
Title translation: 多感官语音GAIN使用一个语言的地位MODEL

公开(公告)号：EP1891624B1

公开(公告)日：2011-05-04

申请号：EP06772956.6

申请日：2006-06-13

Applicant: MICROSOFT CORPORATION

Inventor： ZHANG, Zhengyou , LIU, Zicheng , ACERO, Alejandro , SUBRAMANYA, Amarnag c/o Microsoft Corporation , DROPPO, James, G.

IPC: G10L15/14 , G10L21/00 , G10L21/02 , G10L15/00

CPC classification number: G10L21/0208 , G10L2021/02165

Abstract: A method and apparatus determine a likelihood of a speech state based on an alternative sensor signal (316) and an air conduction microphone signal (318). The likelihood of the speech state is used, together with the alternative sensor signal and the air conduction microphone signal, to estimate (322) a clean speech value for a clean speech signal (324).

18.

发明公开
DETECTING AN ANSWERING MACHINE USING SPEECH RECOGNITION 审中-公开
Title translation: 检测应答机使用语音识别

公开(公告)号：EP2038877A1

公开(公告)日：2009-03-25

申请号：EP07777047.7

申请日：2007-05-15

Applicant: Microsoft Corporation

Inventor： ACERO, Alejandro , FISHER, Craig M. , YU, Dong , WANG, Ye-Yi , JU, Yu-Cheng

IPC: G10L15/22 , G10L15/00 , G10L15/06 , H04M1/67

CPC classification number: G10L25/78 , G10L15/22 , G10L15/26 , H04M3/5158 , H04M2203/2027

Abstract: An answering machine detection module is used to determine whether a call recipient is an actual person or an answering machine. The answering machine detection module includes a speech recognizer and a call analysis module. The speech recognizer receives an audible response of the call recipient to a call. The speech recognizer processes the audible response and provides an output indicative of recognized speech. The call analysis module processes the output of the speech recognizer to generate an output indicative of whether the call recipient is a person or an answering machine.

19.

发明授权
COMBINED SPEECH AND ALTERNATE INPUT MODALITY TO A MOBILE DEVICE 有权
Title translation: 用于移动设备的联合语文及备用输入方式

公开(公告)号：EP1941344B1

公开(公告)日：2013-04-17

申请号：EP06817051.3

申请日：2006-10-16

Applicant: Microsoft Corporation

Inventor： MAHAJAN, Milind V. , ACERO, Alejandro , HSU, Bo-June

IPC: G06F3/16 , G10L15/22

CPC classification number: G10L15/22

20.

发明授权
MULTI-SENSORY SPEECH ENHANCEMENT USING A CLEAN SPEECH PRIOR 有权
Title translation: 多感官演讲改进一块清洁PRIOR语言手段

公开(公告)号：EP1891627B1

公开(公告)日：2010-08-04

申请号：EP06772389.0

申请日：2006-06-06

Applicant: MICROSOFT CORPORATION

Inventor： ACERO, Alejandro , LIU, Zicheng , ZHANG, Zhengyou

IPC: G10L21/02 , G10L15/20

CPC classification number: H04R3/005 , G10L21/0208 , H04R2460/13

Abstract: A method and apparatus to determine a channel response for an alternative sensor using an alternative sensor signal and an air conduction microphone signal (500). The channel response and a prior probabillity distuibution for clean speech valuse then used to estimate a clean speech value (502, 504, 506 and 508).

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification