Speech processing
    1.
    发明专利

    公开(公告)号:GB2349001A

    公开(公告)日:2000-10-18

    申请号:GB0002568

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method in a speech application, has the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709A8

    公开(公告)日:2001-03-06

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709B

    公开(公告)日:2003-08-13

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.

    Maintaining input device identity

    公开(公告)号:GB2349001B

    公开(公告)日:2003-08-06

    申请号:GB0002568

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709A

    公开(公告)日:2000-12-06

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method to enable a user to select input devices for a speech application comprises establishing a registry of device descriptions and building a dynamic table, establishing a plurality of enrollments and generating a GUI. Each device description includes a device specific image, a set of connecting instructions and a list of configuration parameters. Each enrollment represents a user specific file created in a speech training session and corresponds to at least one audio input device and an audio environment. The GUI uses the dynamic tables to enable a user, preferably by a series of nested menus, to select a device from the registry and an associated enrollment for use in the speech application. The registry can provide a list of compatible devices such that when an enrollment does not exist for a particular device, the user can select an enrollment for a compatible device without a training session.

Patent Agency Ranking