-
公开(公告)号:AU2002367354A1
公开(公告)日:2003-07-24
申请号:AU2002367354
申请日:2002-12-20
Applicant: MOTOROLA INC
Inventor: BALASURIYA SENAKA
Abstract: A system and method for multi-level distributed speech recognition includes a terminal ( 122 ) having a terminal speech recognizer ( 136 ) coupled to a microphone ( 130 ). The terminal speech recognizer ( 136 ) receives an audio command ( 37 ), generating at least one terminal recognized audio command having a terminal confidence value. A network element ( 124 ) having at least one network speech recognizer ( 150 ) also receives the audio command ( 149 ), generating a at least one network recognized audio command having a network confidence value. A comparator ( 152 ) receives the recognized audio commands, comparing compares the speech recognition confidence values. The comparator ( 152 ) provides an output ( 162 ) to a dialog manager ( 160 ) of at least one recognized audio command, wherein the dialog manager then executes an operation based on the at least one recognized audio command, such as presenting the at least one recognized audio command to a user for verification or accessing a content server.
-
公开(公告)号:AU2003295628A8
公开(公告)日:2004-07-09
申请号:AU2003295628
申请日:2003-11-18
Applicant: MOTOROLA INC
Inventor: BALASURIYA SENAKA
Abstract: A method and apparatus for selective speech recognition includes receiving a media file (112) having a media type indicator (114). The method and apparatus further includes a browser (104) that receives the media file and a speech recognition engine selector (106) that receives the media type indicator from the browser (104). The selected speech recognition engine selector (106) then selects either a first speech recognition engine (108) or a second speech recognition engine (110), in response to the media type indicator. The method and apparatus further includes an audio receiver (102) that receives an audio input (116) which is provided to the enabled first speech recognition engine (108) or the second speech recognition engine (110), thereupon allowing for the reduction in power consumption by disabling a speech recognition engine (108 or 110) until actively selected by the speech recognition engine selector (106).
-
公开(公告)号:FI20040864A0
公开(公告)日:2004-06-21
申请号:FI20040864
申请日:2004-06-21
Applicant: MOTOROLA INC
Inventor: BALASURIYA SENAKA
Abstract: A multimodal communication system and method creates and accesses a multimodal profile ( 114 ) that contains at least multimodal preference information ( 202 ), such as desired input modality and a desired output modality for a given multimodal communication session. The multimodal profile ( 114 ) may also include at least one identifier ( 204 ) associated with the multimodal preference information ( 202 ). A multimodal communication apparatus ( 102 ) includes a multimodal profile generator ( 110 ) that accesses and/or generates a multimodal profile ( 114 ). A multimodal communication apparatus configuration controller ( 112 ) which is operatively responsive to the accessed multimodal preference information ( 124 ) from a given user profile, configures the multimodal communication apparatus ( 102 ) and/or a network element for the multimodal communication session based on the accessed multimodal preference information ( 124 ) in the multimodal profile.
-
公开(公告)号:AU2003209037A8
公开(公告)日:2003-09-09
申请号:AU2003209037
申请日:2003-02-06
Applicant: MOTOROLA INC
Inventor: JAHNKE JEROME , CUKA DAVID , JOHNSON GREG , GALAGEDARA DILANI , FERRANS JAMES , PIERCE RAINU , BALASURIYA SENAKA
Abstract: A multimodal network element (14) comprises a plurality of proxies (38a ... 38n) that each send a request for concurrent multimodal input information corresponding to multiple input modalities associated with a plurality user agent programs (30, 34) operating during a same session and a multimodal fusion engine (44). The multimodal fusion engine (44) is operatively responsive to received concurrent multimodal input information sent from the plurality of user agent programs (30, 34) sent in response to the request for concurrent different multimodal information and is operative to fuse the different multimodal input information sent from the plurality of user agent programs (30, 34) to provide concurrent multimodal communication from differing user agent programs during a same session.
-
公开(公告)号:AU2003209037A1
公开(公告)日:2003-09-09
申请号:AU2003209037
申请日:2003-02-06
Applicant: MOTOROLA INC
Inventor: PIERCE RAINU , CUKA DAVID , GALAGEDARA DILANI , JOHNSON GREG , BALASURIYA SENAKA , FERRANS JAMES , JAHNKE JEROME
Abstract: A multimodal network element (14) comprises a plurality of proxies (38a ... 38n) that each send a request for concurrent multimodal input information corresponding to multiple input modalities associated with a plurality user agent programs (30, 34) operating during a same session and a multimodal fusion engine (44). The multimodal fusion engine (44) is operatively responsive to received concurrent multimodal input information sent from the plurality of user agent programs (30, 34) sent in response to the request for concurrent different multimodal information and is operative to fuse the different multimodal input information sent from the plurality of user agent programs (30, 34) to provide concurrent multimodal communication from differing user agent programs during a same session.
-
公开(公告)号:AU2003209036A1
公开(公告)日:2003-09-09
申请号:AU2003209036
申请日:2003-02-06
Applicant: MOTOROLA INC
Inventor: JAHNKE JEROME , PIERCE RAINU , CUKA DAVID , GALAGEDARA DILANI , JOHNSON GREG , BALASURIYA SENAKA , FERRANS JAMES
IPC: G06F3/16 , G06F9/44 , G06F13/00 , H04B20060101 , H04B1/38 , H04L20060101 , H04L29/08
Abstract: A method and apparatus maintains, during non-session conditions and on a per user basis, concurrent multimodal session status information of user agent programs configured for different concurrent modality communication during the same session, and re-establish a concurrent multimodal session in response to accessing the concurrent multimodal session status information.
-
公开(公告)号:AU2002357349A1
公开(公告)日:2003-07-24
申请号:AU2002357349
申请日:2002-12-20
Applicant: MOTOROLA INC
Inventor: BALASURIYA SENAKA
Abstract: A multimodal communication system and method creates and accesses a multimodal profile ( 114 ) that contains at least multimodal preference information ( 202 ), such as desired input modality and a desired output modality for a given multimodal communication session. The multimodal profile ( 114 ) may also include at least one identifier ( 204 ) associated with the multimodal preference information ( 202 ). A multimodal communication apparatus ( 102 ) includes a multimodal profile generator ( 110 ) that accesses and/or generates a multimodal profile ( 114 ). A multimodal communication apparatus configuration controller ( 112 ) which is operatively responsive to the accessed multimodal preference information ( 124 ) from a given user profile, configures the multimodal communication apparatus ( 102 ) and/or a network element for the multimodal communication session based on the accessed multimodal preference information ( 124 ) in the multimodal profile.
-
公开(公告)号:AU2002351406A1
公开(公告)日:2003-07-24
申请号:AU2002351406
申请日:2002-12-20
Applicant: MOTOROLA INC
Inventor: BALASURIYA SENAKA
Abstract: A method and apparatus for multi-modal communication includes a controller (236) operably coupled to at least one multi-modal session proxy server (226). On a per multi-modal session basis, the controller (236) provides the multi-modal session proxy server (226) with a multi-modal proxy identifier (138). The multi-modal proxy identifier (138) is then provided to at least one browser with a per session multi-modal proxy evaluator (220) having a browser proxy identifier (140) wherein the browser proxy identifier (140) is evaluated in view of the multi-modal proxy identifier (138). The multi-modal session proxy server (226) then receives an information request (231) from the browser with per session multi-modal proxy evaluator (220) wherein the requested information is fetched from a content server (240). When the requested information is retrieved, a multi-modal synchronization coordinator (122) notifies the other browser with per session multi-modal proxy evaluator (232), via a multi-modal synchronization interface (234).
-
公开(公告)号:EP1690428A4
公开(公告)日:2007-10-17
申请号:EP04811848
申请日:2004-11-23
Applicant: MOTOROLA INC
Inventor: BALASURIYA SENAKA , JAGADESAN BALAKUMAR
CPC classification number: H04W4/10 , H04L65/1016 , H04L65/4061 , H04W72/005 , H04W72/10 , H04W76/005
Abstract: An apparatus, architecture and method for floor control in a Push-to-Talk system. A mobile station ( 203 ) may transmit a floor request message or messages and request multiple floors. Each floor may correspond to a media type having multiple media streams. A PoC server ( 201 ) assigns a priority to media types and/or media streams such that for example, a mobile station ( 203 ) may have a floor to transmit a video clip having audio and video streams to a talk group ( 207 ), and a member of the talk group may have a floor to transmit audio voice commentary on the media to the talk group ( 207 ). The embodiments of the present invention enable multimedia communication use cases without the need for duplication of the state machine at each node, thereby conserving resources.
-
20.
公开(公告)号:EP1481334A4
公开(公告)日:2005-11-23
申请号:EP03707762
申请日:2003-02-06
Applicant: MOTOROLA INC
Inventor: JOHNSON GREG , BALASURIYA SENAKA , FERRANS JAMES , JAHNKE JEROME , PIERCE RAINU , CUKA DAVID , GALAGEDARA DILANI
CPC classification number: G06F17/30899 , G06F3/16 , H04M1/72561 , H04M3/4938
Abstract: A multimodal network element 14 facilitates concurrent multimodal communication sessions through differing user agent programs 30, 34 on one or more devices 12, 16. For example, a user agent program communicating in a voice mode, such as a voice browser 34 in a voice gateway 16 that includes a speech engine and call/session termination, is synchronized with another user agent program operating in a different modality, such as a graphical browser 30 on a mobile device 12. The plurality of user agent programs 30, 34 are operatively coupled with a content server 18 during a session to enable concurrent multimodal interaction.
-
-
-
-
-
-
-
-
-