MAINTENANCE OF INPUT DEVICE IDENTIFICATION INFORMATION

    公开(公告)号:JP2000250578A

    公开(公告)日:2000-09-14

    申请号:JP2000046073

    申请日:2000-02-23

    Applicant: IBM

    Abstract: PROBLEM TO BE SOLVED: To permit correct relation to a voice input device corresponding to user training data by generating a graphical user interface(GUI) display picture making it possible to select the user of at least one of a voice input device and audio environment. SOLUTION: A voice file of training data relating to at least one of a specific voice input device and specific audio environment as to a specific user is stored in enrollments. Then the GUI display picture which prompting and enabling the selection of the user of at least one of the voice input device and audio environment is generated. Namely, when the user has >=2 sound cards supporting speech recognition, the GUI picture for selecting one of the sound cards is presented to the user. This picture makes the user to select which sound card is used for input, i.e., recording and which sound card is used for output, i.e., reproduction.

    USER SELECTION ENABLE INPUT DEVICE FOR VOICE APPLICATION

    公开(公告)号:JP2000250734A

    公开(公告)日:2000-09-14

    申请号:JP2000034466

    申请日:2000-02-14

    Applicant: IBM

    Abstract: PROBLEM TO BE SOLVED: To guide a procedure for changing a voice input device to a user by generating a GUI display picture while using information in a dynamic table including information extracted from the registry of dictation recording and dictation reproducing device descriptions. SOLUTION: The GUI picture for a user to select one of sound cards for supporting voice recognition is displayed to the user and the user selects respective sound cards to be used for recording and reproducing (S18). Afterwards, the type of the input device is judged by reference to user peculiar data such as user selection, command row parameter, registry item and registration identification information, for example, and a voice reproducing level is tested and adjusted into a comfortable setting value (S20 and S22). One or plural switch setting values capable of predicting prevention of voice feedback are selected, the voice feedback is prevented and the input device is selected and connected.

    Speech processing
    3.
    发明专利

    公开(公告)号:GB2349001A

    公开(公告)日:2000-10-18

    申请号:GB0002568

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method in a speech application, has the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709A8

    公开(公告)日:2001-03-06

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709B

    公开(公告)日:2003-08-13

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.

    Maintaining input device identity

    公开(公告)号:GB2349001B

    公开(公告)日:2003-08-06

    申请号:GB0002568

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.

    User selectable input devices for speech applications

    公开(公告)号:GB2350709A

    公开(公告)日:2000-12-06

    申请号:GB0002461

    申请日:2000-02-03

    Applicant: IBM

    Abstract: A method to enable a user to select input devices for a speech application comprises establishing a registry of device descriptions and building a dynamic table, establishing a plurality of enrollments and generating a GUI. Each device description includes a device specific image, a set of connecting instructions and a list of configuration parameters. Each enrollment represents a user specific file created in a speech training session and corresponds to at least one audio input device and an audio environment. The GUI uses the dynamic tables to enable a user, preferably by a series of nested menus, to select a device from the registry and an associated enrollment for use in the speech application. The registry can provide a list of compatible devices such that when an enrollment does not exist for a particular device, the user can select an enrollment for a compatible device without a training session.

Patent Agency Ranking