Invention Grant
US08635067B2 Model restructuring for client and server based automatic speech recognition
失效
基于客户端和服务器的自动语音识别模型重组
- Patent Title: Model restructuring for client and server based automatic speech recognition
- Patent Title (中): 基于客户端和服务器的自动语音识别模型重组
-
Application No.: US12964433Application Date: 2010-12-09
-
Publication No.: US08635067B2Publication Date: 2014-01-21
- Inventor: Pierre Dognin , Vaibhava Goel , John R. Hershey , Peder A. Olsen
- Applicant: Pierre Dognin , Vaibhava Goel , John R. Hershey , Peder A. Olsen
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Otterstedt, Ellenbogen & Kammer, LLP
- Agent Anne V. Dougherty
- Main IPC: G10L15/14
- IPC: G10L15/14

Abstract:
Access is obtained to a large reference acoustic model for automatic speech recognition. The large reference acoustic model has L states modeled by L mixture models, and the large reference acoustic model has N components. A desired number of components Nc, less than N, to be used in a restructured acoustic model derived from the reference acoustic model, is identified. The desired number of components Nc is selected based on a computing environment in which the restructured acoustic model is to be deployed. The restructured acoustic model also has L states. For each given one of the L mixture models in the reference acoustic model, a merge sequence is built which records, for a given cost function, sequential mergers of pairs of the components associated with the given one of the mixture models. A portion of the Nc components is assigned to each of the L states in the restructured acoustic model. The restructured acoustic model is built by, for each given one of the L states in the restructured acoustic model, applying the merge sequence to a corresponding one of the L mixture models in the reference acoustic model until the portion of the Nc components assigned to the given one of the L states is achieved.
Public/Granted literature
- US20120150536A1 MODEL RESTRUCTURING FOR CLIENT AND SERVER BASED AUTOMATIC SPEECH RECOGNITION Public/Granted day:2012-06-14
Information query