2.
    发明专利
    未知

    公开(公告)号:DE60100090T2

    公开(公告)日:2003-09-25

    申请号:DE60100090

    申请日:2001-06-12

    Applicant: IBM

    Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data. Similarly, the exit criteria can include least frequently used speech data associated with each entry in the speech cache, least recently used speech data associated with each entry in the speech cache and least important speed data associated with each entry in the speech cache.

    3.
    发明专利
    未知

    公开(公告)号:DE60023398T2

    公开(公告)日:2006-07-06

    申请号:DE60023398

    申请日:2000-05-24

    Applicant: IBM

    Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.

    4.
    发明专利
    未知

    公开(公告)号:AT231643T

    公开(公告)日:2003-02-15

    申请号:AT01000205

    申请日:2001-06-12

    Applicant: IBM

    Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data. Similarly, the exit criteria can include least frequently used speech data associated with each entry in the speech cache, least recently used speech data associated with each entry in the speech cache and least important speed data associated with each entry in the speech cache.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709B

    公开(公告)日:2003-08-13

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.

    Maintaining input device identity

    公开(公告)号:GB2349001B

    公开(公告)日:2003-08-06

    申请号:GB0002568

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709A

    公开(公告)日:2000-12-06

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method to enable a user to select input devices for a speech application comprises establishing a registry of device descriptions and building a dynamic table, establishing a plurality of enrollments and generating a GUI. Each device description includes a device specific image, a set of connecting instructions and a list of configuration parameters. Each enrollment represents a user specific file created in a speech training session and corresponds to at least one audio input device and an audio environment. The GUI uses the dynamic tables to enable a user, preferably by a series of nested menus, to select a device from the registry and an associated enrollment for use in the speech application. The registry can provide a list of compatible devices such that when an enrollment does not exist for a particular device, the user can select an enrollment for a compatible device without a training session.

    METODO Y APARATO PARA MEJORAR LA PRECISION DEL RECONOCIMIENTO DE LA ORDEN DE VOZ UTILIZANDO LIMITACIONES BASADAS EN EVENTOS.

    公开(公告)号:ES2248018T3

    公开(公告)日:2006-03-16

    申请号:ES00304415

    申请日:2000-05-24

    Applicant: IBM

    Abstract: Un método para uso en un programa informático para reconocimiento de voz que funciona en diversos estados y ejecuta un programa para poner en práctica diversos sucesos, para reconocer una instrucción oral, que comprende los pasos de: monitorizar (54) al menos uno de dichos sucesos y estados; recibir una instrucción procesada correspondiente a dicha instrucción oral; analizar (44) dicha instrucción procesada de acuerdo con al menos un modelo acústico a fin de identificar una probable coincidencia acústica; analizar (48) dicha instrucción procesada a fin de identificar una probable coincidencia de contexto usando un modelo estadístico para analizar al menos uno de dichos sucesos y estados de acuerdo con un conjunto finito de instrucciones ponderadas de acuerdo con la probabilidad estadística de sus sucesos correspondientes que ocurren en el estado dado; y proporcionar una instrucción reconocida basada en dichas probables coincidencias acústicas y de contexto.

    10.
    发明专利
    未知

    公开(公告)号:DE60023398D1

    公开(公告)日:2005-12-01

    申请号:DE60023398

    申请日:2000-05-24

    Applicant: IBM

    Abstract: A method and system for improving the speech command recognition accuracy of a computer speech recognition system uses event-based constraints to recognize a spoken command. The constraints are system states and events, which include system activities, active applications, prior commands and an event queue. The method and system is performed by monitoring events and states of the computer system and receiving a processed command corresponding to the spoken command. The processed command is statistically analyzed in light of the system events and states as well as according to an acoustic model. The system then identifies a recognized command corresponding to the spoken command.

Patent Agency Ranking