METHOD AND APPARATUS FOR MULTI-LEVEL DISTRIBUTED SPEECH RECOGNITION

    公开(公告)号:AU2002367354A1

    公开(公告)日:2003-07-24

    申请号:AU2002367354

    申请日:2002-12-20

    Applicant: MOTOROLA INC

    Abstract: A system and method for multi-level distributed speech recognition includes a terminal ( 122 ) having a terminal speech recognizer ( 136 ) coupled to a microphone ( 130 ). The terminal speech recognizer ( 136 ) receives an audio command ( 37 ), generating at least one terminal recognized audio command having a terminal confidence value. A network element ( 124 ) having at least one network speech recognizer ( 150 ) also receives the audio command ( 149 ), generating a at least one network recognized audio command having a network confidence value. A comparator ( 152 ) receives the recognized audio commands, comparing compares the speech recognition confidence values. The comparator ( 152 ) provides an output ( 162 ) to a dialog manager ( 160 ) of at least one recognized audio command, wherein the dialog manager then executes an operation based on the at least one recognized audio command, such as presenting the at least one recognized audio command to a user for verification or accessing a content server.

    Method and apparatus for selective speech recognition

    公开(公告)号:AU2003295628A8

    公开(公告)日:2004-07-09

    申请号:AU2003295628

    申请日:2003-11-18

    Applicant: MOTOROLA INC

    Abstract: A method and apparatus for selective speech recognition includes receiving a media file (112) having a media type indicator (114). The method and apparatus further includes a browser (104) that receives the media file and a speech recognition engine selector (106) that receives the media type indicator from the browser (104). The selected speech recognition engine selector (106) then selects either a first speech recognition engine (108) or a second speech recognition engine (110), in response to the media type indicator. The method and apparatus further includes an audio receiver (102) that receives an audio input (116) which is provided to the enabled first speech recognition engine (108) or the second speech recognition engine (110), thereupon allowing for the reduction in power consumption by disabling a speech recognition engine (108 or 110) until actively selected by the speech recognition engine selector (106).

    13.
    发明专利
    未知

    公开(公告)号:FI20040864A0

    公开(公告)日:2004-06-21

    申请号:FI20040864

    申请日:2004-06-21

    Applicant: MOTOROLA INC

    Abstract: A multimodal communication system and method creates and accesses a multimodal profile ( 114 ) that contains at least multimodal preference information ( 202 ), such as desired input modality and a desired output modality for a given multimodal communication session. The multimodal profile ( 114 ) may also include at least one identifier ( 204 ) associated with the multimodal preference information ( 202 ). A multimodal communication apparatus ( 102 ) includes a multimodal profile generator ( 110 ) that accesses and/or generates a multimodal profile ( 114 ). A multimodal communication apparatus configuration controller ( 112 ) which is operatively responsive to the accessed multimodal preference information ( 124 ) from a given user profile, configures the multimodal communication apparatus ( 102 ) and/or a network element for the multimodal communication session based on the accessed multimodal preference information ( 124 ) in the multimodal profile.

    MULTIMODAL COMMUNICATION METHOD AND APPARATUS WITH MULTIMODAL PROFILE

    公开(公告)号:AU2002357349A1

    公开(公告)日:2003-07-24

    申请号:AU2002357349

    申请日:2002-12-20

    Applicant: MOTOROLA INC

    Abstract: A multimodal communication system and method creates and accesses a multimodal profile ( 114 ) that contains at least multimodal preference information ( 202 ), such as desired input modality and a desired output modality for a given multimodal communication session. The multimodal profile ( 114 ) may also include at least one identifier ( 204 ) associated with the multimodal preference information ( 202 ). A multimodal communication apparatus ( 102 ) includes a multimodal profile generator ( 110 ) that accesses and/or generates a multimodal profile ( 114 ). A multimodal communication apparatus configuration controller ( 112 ) which is operatively responsive to the accessed multimodal preference information ( 124 ) from a given user profile, configures the multimodal communication apparatus ( 102 ) and/or a network element for the multimodal communication session based on the accessed multimodal preference information ( 124 ) in the multimodal profile.

    MULTI-MODAL COMMUNICATION USING A SESSION SPECIFIC PROXY SERVER

    公开(公告)号:AU2002351406A1

    公开(公告)日:2003-07-24

    申请号:AU2002351406

    申请日:2002-12-20

    Applicant: MOTOROLA INC

    Abstract: A method and apparatus for multi-modal communication includes a controller (236) operably coupled to at least one multi-modal session proxy server (226). On a per multi-modal session basis, the controller (236) provides the multi-modal session proxy server (226) with a multi-modal proxy identifier (138). The multi-modal proxy identifier (138) is then provided to at least one browser with a per session multi-modal proxy evaluator (220) having a browser proxy identifier (140) wherein the browser proxy identifier (140) is evaluated in view of the multi-modal proxy identifier (138). The multi-modal session proxy server (226) then receives an information request (231) from the browser with per session multi-modal proxy evaluator (220) wherein the requested information is fetched from a content server (240). When the requested information is retrieved, a multi-modal synchronization coordinator (122) notifies the other browser with per session multi-modal proxy evaluator (232), via a multi-modal synchronization interface (234).

    FLOOR CONTROL IN MULTIMEDIA PUSH-TO-TALK
    19.
    发明公开
    FLOOR CONTROL IN MULTIMEDIA PUSH-TO-TALK 审中-公开
    过程控制。多媒体PUSH-TO-TALK

    公开(公告)号:EP1690428A4

    公开(公告)日:2007-10-17

    申请号:EP04811848

    申请日:2004-11-23

    Applicant: MOTOROLA INC

    Abstract: An apparatus, architecture and method for floor control in a Push-to-Talk system. A mobile station ( 203 ) may transmit a floor request message or messages and request multiple floors. Each floor may correspond to a media type having multiple media streams. A PoC server ( 201 ) assigns a priority to media types and/or media streams such that for example, a mobile station ( 203 ) may have a floor to transmit a video clip having audio and video streams to a talk group ( 207 ), and a member of the talk group may have a floor to transmit audio voice commentary on the media to the talk group ( 207 ). The embodiments of the present invention enable multimedia communication use cases without the need for duplication of the state machine at each node, thereby conserving resources.

Patent Agency Ranking