INTELLIGENT PERSONAL ASSISTANTS
    1.
    发明申请
    INTELLIGENT PERSONAL ASSISTANTS 审中-公开
    智能个人助理

    公开(公告)号:WO2003073417A2

    公开(公告)日:2003-09-04

    申请号:PCT/US2003/006218

    申请日:2003-02-26

    Inventor: GONG, Li

    CPC classification number: G06N3/004 G06F9/453

    Abstract: An intelligent social agent is an animated computer interface agent with social intelligence that has been developed for a given application or type of applications and a particular user population. The social intelligence of the agent comes from the ability of the agent to be appealing, affective, adaptive, and an appropriate when interacting with the user. An intelligent personal assistant is an implementation of an intelligent social agent that assists a user in operating a computing device and using application programs on a computing device.

    Abstract translation: 智能社交代理是一种具有社交智能的动画计算机接口代理,已经针对给定的应用程序或类型的应用程序和特定的用户群体进行了开发(图4b)。 代理人的社会智慧来自于代理人与用户交互时的吸引力,情感性,适应性和适当性的能力。 智能个人助理是智能社交代理的实现,它帮助用户操作计算设备并在计算设备上使用应用程序。

    SPEECH ANIMATION
    2.
    发明申请
    SPEECH ANIMATION 审中-公开
    演讲动画

    公开(公告)号:WO2005020213A1

    公开(公告)日:2005-03-03

    申请号:PCT/US2004/026520

    申请日:2004-08-13

    CPC classification number: G06T13/205 G10L2021/105

    Abstract: Methods and systems, including computer program products, for speech animation. The system (100) includes a speech animation engine (110) and a client application (120) in communication with the speech animation engine (110). The client application sends a request for speech animation to the speech animation engine (110). The request identifies data (140) to be used to generate the speech animation, where speech animation is speech synchronized with facial expressions. The client application (120) receives a response from the speech animation engine (110). The response identifies the generated speech animation. The client application (120) uses the generated speech animation to animate a talking agent (150) displayed on a user interface (130) of the client application (120). The speech animation engine (110) receives the request for speech animation from the client application (120), retrieves the data (140) identified in the request without user intervention, generates the speech animation using the retrieved data and sends the response identifying the generated speech animation to the client application (120).

    Abstract translation: 方法和系统,包括计算机程序产品,用于语音动画。 系统(100)包括与语音动画引擎(110)通信的语音动画引擎(110)和客户端应用(120)。 客户应用向语音动画引擎(110)发送语音动画请求。 该请求标识要用于生成语音动画的数据(140),其中语音动画是与面部表情同步的语音。 客户端应用程序(120)从语音动画引擎(110)接收响应。 响应标识生成的语音动画。 客户端应用程序(120)使用生成的语音动画来动画化显示在客户端应用程序(120)的用户界面(130)上的通话代理(150)。 语音动画引擎(110)从客户应用程序(120)接收对语音动画的请求,在没有用户干预的情况下检索请求中识别的数据(140),并使用检索到的数据生成语音动画,并发送标识生成的 语音动画到客户端应用程序(120)。

    MULTI-MODAL WAREHOUSE APPLICATIONS
    3.
    发明申请

    公开(公告)号:WO2004084024A3

    公开(公告)日:2004-09-30

    申请号:PCT/US2004/007724

    申请日:2004-03-12

    Abstract: An inventory management system includes an electronic device operable to receive job data related to a task, such as for example, picking, stocking, or counting, performed by a worker (3510) in a warehouse (3502), in a selected one of a plurality of available input modalities. The system also includes an inventory database (3517) operable to store inventory data that includes count information and location information for each of a plurality of items, accessible in a plurality of formats, such as, for example Voice Extensible Markup Language (VXML) or Hyper Text Markup Language (HTML), each compatible with one of the available input modalities. The system also includes a format determination system (3546) operable to input inventory data in a received one of the formats and determine corresponding inventory data in remaining ones of the formats. The system also includes a server (3544) operable to receive the job data in the received format, communicate with the format determination system (3546) to determine the remaining formats, and output updated inventory data to the electronic device, such that the inventory data is maintained during performance of inventory management tasks.

    MULTI-MODAL SYNCHRONIZATION
    4.
    发明申请
    MULTI-MODAL SYNCHRONIZATION 审中-公开
    多模式同步

    公开(公告)号:WO2003067413A1

    公开(公告)日:2003-08-14

    申请号:PCT/US2003/003828

    申请日:2003-02-07

    CPC classification number: H04M3/4938 H04M2201/22

    Abstract: A system (110) for synchronizing multiple modalities is described. A user may use multiple modalities, such as voice and browser, to interact with data on a network, such as the World Wide Web. All of the modalities may be synchronized so that all are updated when the user enters information in just one. A method of communicating between devices (160, 185) includes receiving a request for first-modality data that includes first content, and sending a message in response to receiving the request, the message including information allowing the request of second-modality data that includes second content overlapping the first content. Another method includes requesting first data for a first modality, the first data including first content, and automatically requesting second data for a second modality, wherein the second data includes second content that overlaps the first content.

    Abstract translation: 描述用于同步多个模态的系统(110)。 用户可以使用诸如语音和浏览器的多种模式来与诸如万维网的网络上的数据进行交互。 所有模式可以被同步,以便当用户仅在一个中输入信息时,所有模式都被更新。 一种在设备(160,185)之间通信的方法包括:接收对包括第一内容的第一模态数据的请求,以及响应于接收到所述请求而发送消息,所述消息包括允许包括第二模态数据的请求的信息 第二内容与第一内容重叠。 另一种方法包括向第一模式请求第一数据,第一数据包括第一内容,以及自动请求用于第二模式的第二数据,其中第二数据包括与第一内容重叠的第二内容。

    INTELLIGENT PERSONAL ASSISTANTS
    6.
    发明申请

    公开(公告)号:WO2003073417A3

    公开(公告)日:2003-09-04

    申请号:PCT/US2003/006218

    申请日:2003-02-26

    Inventor: GONG, Li

    Abstract: An intelligent social agent is an animated computer interface agent with social intelligence that has been developed for a given application or type of applications and a particular user population (Figure 4b). The social intelligence of the agent comes from the ability of the agent to be appealing, affective, adaptive, and an appropriate when interacting with the user. An intelligent personal assistant is an implementation of an intelligent social agent that assists a user in operating a computing device and using application programs on a computing device.

    USER INTERFACE AND DYNAMIC GRAMMAR IN A MULTI-MODAL SYNCHRONIZATION ARCHITECTURE
    7.
    发明申请
    USER INTERFACE AND DYNAMIC GRAMMAR IN A MULTI-MODAL SYNCHRONIZATION ARCHITECTURE 审中-公开
    多模式同步架构中的用户界面和动态灰度

    公开(公告)号:WO2003067443A1

    公开(公告)日:2003-08-14

    申请号:PCT/US2003/003752

    申请日:2003-02-07

    CPC classification number: H04M3/4938 H04M3/4931 H04M3/4936 H04M2201/40

    Abstract: A first-modality gateway (165) and a second-modality gateway (185) are synchronized, with both gateways interfacing between a user and a server system. Various approaches are described for structuring a grammar of a voice recognition by limiting the amount of the grammar that is searched, thus minimizing the incidence of misrecognition. Communicating with a user may include presenting the user a first set of options and a second set of options, wherein the second set of options is limited based on the user's selection from the first set of options. A graphical user interface (2410) may include a form with a plurality of fields, each field associated with a predetermined category. Each category may have its own, independent, discrete grammar associated therewith, and the independent grammars (2420, 2430, 2440) may be individually activated, simultaneously with their respective categories.

    Abstract translation: 第一模式网关(165)和第二模态网关(185)被同步,两个网关在用户和服务器系统之间进行接口。 描述了通过限制搜索的语法的量来构造语音识别的语法的各种方法,从而最小化误识别的发生率。 与用户通信可以包括向用户呈现第一组选项和第二组选项,其中基于来自第一组选项的用户的选择来限制第二组选项。 图形用户界面(2410)可以包括具有多个字段的表单,每个字段与预定类别相关联。 每个类别可以具有与其相关联的其自己的,独立的离散语法,并且独立语法(2420,2430,2440)可以与它们各自的类别同时被单独激活。

    MULTI-MODAL SYNCHRONIZATION
    8.
    发明公开
    MULTI-MODAL SYNCHRONIZATION 有权
    多模式同步

    公开(公告)号:EP1483654A1

    公开(公告)日:2004-12-08

    申请号:EP03737710.8

    申请日:2003-02-07

    CPC classification number: H04M3/4938 H04M2201/22

    Abstract: A system (110) for synchronizing multiple modalities is described. A user may use multiple modalities, such as voice and browser, to interact with data on a network, such as the World Wide Web. All of the modalities may be synchronized so that all are updated when the user enters information in just one. A method of communicating between devices (160, 185) includes receiving a request for first-modality data that includes first content, and sending a message in response to receiving the request, the message including information allowing the request of second-modality data that includes second content overlapping the first content. Another method includes requesting first data for a first modality, the first data including first content, and automatically requesting second data for a second modality, wherein the second data includes second content that overlaps the first content.

    USER INTERFACE AND DYNAMIC GRAMMAR IN A MULTI-MODAL SYNCHRONIZATION ARCHITECTURE
    9.
    发明公开
    USER INTERFACE AND DYNAMIC GRAMMAR IN A MULTI-MODAL SYNCHRONIZATION ARCHITECTURE 有权
    用户界面和动态语法在多模式同步体系结构

    公开(公告)号:EP1481328A1

    公开(公告)日:2004-12-01

    申请号:EP03710916.2

    申请日:2003-02-07

    CPC classification number: H04M3/4938 H04M3/4931 H04M3/4936 H04M2201/40

    Abstract: A first-modality gateway (165) and a second-modality gateway (185) are synchronized, with both gateways interfacing between a user and a server system. Various approaches are described for structuring a grammar of a voice recognition by limiting the amount of the grammar that is searched, thus minimizing the incidence of misrecognition. Communicating with a user may include presenting the user a first set of options and a second set of options, wherein the second set of options is limited based on the user's selection from the first set of options. A graphical user interface (2410) may include a form with a plurality of fields, each field associated with a predetermined category. Each category may have its own, independent, discrete grammar associated therewith, and the independent grammars (2420, 2430, 2440) may be individually activated, simultaneously with their respective categories.

    INTELLIGENT PERSONAL ASSISTANTS
    10.
    发明公开
    INTELLIGENT PERSONAL ASSISTANTS 审中-公开
    智能个人助理

    公开(公告)号:EP1490864A2

    公开(公告)日:2004-12-29

    申请号:EP03743263.0

    申请日:2003-02-26

    Inventor: GONG, Li

    CPC classification number: G06N3/004 G06F9/453

    Abstract: An intelligent social agent is an animated computer interface agent with social intelligence that has been developed for a given application or type of applications and a particular user population (Figure 4b). The social intelligence of the agent comes from the ability of the agent to be appealing, affective, adaptive, and an appropriate when interacting with the user. An intelligent personal assistant is an implementation of an intelligent social agent that assists a user in operating a computing device and using application programs on a computing device.

Patent Agency Ranking