-
公开(公告)号:JP2002073084A
公开(公告)日:2002-03-12
申请号:JP2001178028
申请日:2001-06-13
Applicant: IBM
Inventor: BADT DANIEL E , GUASTI PETER J , HANSON GARY R , NASSIFF AMADO , RODRIGUEZ EDWIN A , RUBACK HARVEY R , SMITH CARL A , VANBUSKIRK RONALD E , WANG HUIFANG , STEVEN G WOODWARD
Abstract: PROBLEM TO BE SOLVED: To provide a method and a system for improving the re-readout of voice data in a computer voice system. SOLUTION: When the voice data can be compared with a prescribed voice cache entry reference/emission standard and the voice data satisfies one of the prescribed entry references, the additional purge of at least one item can be performed to a voice cache. The item corresponds to the voice data. The entry reference can include voice data used frequently, voice data used recently, and important voice data. Similarly, the emission standard can include voice data having the minimum frequency of use relating to each item in the voice caches, voice data unused for the longest time relating to each item in the voice caches, and voice data having the minimum level of importance relating to each item in the voice caches.
-
公开(公告)号:DE60100090T2
公开(公告)日:2003-09-25
申请号:DE60100090
申请日:2001-06-12
Applicant: IBM
Inventor: BADT DANIEL E , GUASTI PETER J , HANSON GARY R , NASSIFF AMADO , RODRIGUEZ EDWIN A , RUBACK HARVEY R , SMITH CARL A , VANBUSKIRK RONALD E , WANG HUIFANG
Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data. Similarly, the exit criteria can include least frequently used speech data associated with each entry in the speech cache, least recently used speech data associated with each entry in the speech cache and least important speed data associated with each entry in the speech cache.
-
公开(公告)号:DE60023398T2
公开(公告)日:2006-07-06
申请号:DE60023398
申请日:2000-05-24
Applicant: IBM
Inventor: BALLARD BARBARA ELAINE , LEWIS JAMES R , NASSIFF AMADO , ORTEGA KERRY A , VANBUSKIRK RONALD E , WANG HUIFANG
Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.
-
公开(公告)号:AT231643T
公开(公告)日:2003-02-15
申请号:AT01000205
申请日:2001-06-12
Applicant: IBM
Inventor: BADT DANIEL E , GUASTI PETER J , HANSON GARY R , NASSIFF AMADO , RODRIGUEZ EDWIN A , RUBACK HARVEY R , SMITH CARL A , VANBUSKIRK RONALD E , WANG HUIFANG
Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data. Similarly, the exit criteria can include least frequently used speech data associated with each entry in the speech cache, least recently used speech data associated with each entry in the speech cache and least important speed data associated with each entry in the speech cache.
-
5.
公开(公告)号:CA2303718A1
公开(公告)日:2000-11-29
申请号:CA2303718
申请日:2000-04-05
Applicant: IBM
Inventor: BALLARD BARBARA ELAINE , WANG HUIFANG , VANBUSKIRK RONALD E , ORTEGA KERRY A , NASSIFF AMADO , LEWIS JAMES R
Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.
-
公开(公告)号:GB2350709B
公开(公告)日:2003-08-13
申请号:GB0002461
申请日:2000-02-03
Applicant: IBM
Inventor: FADO FRANK , GUASTI PETER , NASSIFF AMADO , BUSKIRK RONALD VAN , RUBACK HARVEY
Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.
-
公开(公告)号:GB2349001B
公开(公告)日:2003-08-06
申请号:GB0002568
申请日:2000-02-03
Applicant: IBM
Inventor: FADO FRANK , GUASTI PETER , NASSIFF AMADO , BUSKIRK RONALD VAN , RUBACK HARVEY
Abstract: A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.
-
公开(公告)号:GB2350709A
公开(公告)日:2000-12-06
申请号:GB0002461
申请日:2000-02-03
Applicant: IBM
Inventor: FADO FRANK , GUASTI PETER , NASSIFF AMADO , BUSKIRK RONALD VAN , RUBACK HARVEY
Abstract: A method to enable a user to select input devices for a speech application comprises establishing a registry of device descriptions and building a dynamic table, establishing a plurality of enrollments and generating a GUI. Each device description includes a device specific image, a set of connecting instructions and a list of configuration parameters. Each enrollment represents a user specific file created in a speech training session and corresponds to at least one audio input device and an audio environment. The GUI uses the dynamic tables to enable a user, preferably by a series of nested menus, to select a device from the registry and an associated enrollment for use in the speech application. The registry can provide a list of compatible devices such that when an enrollment does not exist for a particular device, the user can select an enrollment for a compatible device without a training session.
-
公开(公告)号:ES2248018T3
公开(公告)日:2006-03-16
申请号:ES00304415
申请日:2000-05-24
Applicant: IBM
Inventor: BALLARD BARBARA ELAINE , LEWIS JAMES R , NASSIFF AMADO , ORTEGA KERRY A , VANBUSKIRK RONALD E , WANG HUIFANG
Abstract: Un método para uso en un programa informático para reconocimiento de voz que funciona en diversos estados y ejecuta un programa para poner en práctica diversos sucesos, para reconocer una instrucción oral, que comprende los pasos de: monitorizar (54) al menos uno de dichos sucesos y estados; recibir una instrucción procesada correspondiente a dicha instrucción oral; analizar (44) dicha instrucción procesada de acuerdo con al menos un modelo acústico a fin de identificar una probable coincidencia acústica; analizar (48) dicha instrucción procesada a fin de identificar una probable coincidencia de contexto usando un modelo estadístico para analizar al menos uno de dichos sucesos y estados de acuerdo con un conjunto finito de instrucciones ponderadas de acuerdo con la probabilidad estadística de sus sucesos correspondientes que ocurren en el estado dado; y proporcionar una instrucción reconocida basada en dichas probables coincidencias acústicas y de contexto.
-
公开(公告)号:DE60023398D1
公开(公告)日:2005-12-01
申请号:DE60023398
申请日:2000-05-24
Applicant: IBM
Inventor: BALLARD BARBARA ELAINE , LEWIS JAMES R , NASSIFF AMADO , ORTEGA KERRY A , VANBUSKIRK RONALD E , WANG HUIFANG
Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.
-
-
-
-
-
-
-
-
-