VISUAL FEEDBACK FOR SPEECH RECOGNITION SYSTEM
    1.
    发明申请
    VISUAL FEEDBACK FOR SPEECH RECOGNITION SYSTEM 审中-公开
    用于语音识别系统的视觉反馈

    公开(公告)号:WO2014116548A1

    公开(公告)日:2014-07-31

    申请号:PCT/US2014/012229

    申请日:2014-01-21

    CPC classification number: G10L21/10 G06F3/0304 G06F3/167 G10L2015/225

    Abstract: Embodiments are disclosed that relate to providing visual feedback in a speech recognition system. For example, one disclosed embodiment provides a method including displaying a graphical feedback indicator having a variable appearance dependent upon a state of the speech recognition system. The method further comprises receiving a speech input, modifying an appearance of the graphical feedback indicator in a first manner if the speech input is heard and understood by the system, and modifying the appearance of the graphical feedback indicator in a different manner than the first manner if the speech input is heard and not understood.

    Abstract translation: 公开了涉及在语音识别系统中提供视觉反馈的实施例。 例如,一个公开的实施例提供了一种方法,包括根据语音识别系统的状态显示具有可变外观的图形反馈指示符。 该方法还包括接收语音输入,如果系统听到和理解了语音输入,则以第一种方式修改图形反馈指示符的外观,并以与第一种方式不同的方式修改图形反馈指示符的外观 如果语音输入被听到并且不被理解。

    INDEFINITE SPEECH INPUTS
    2.
    发明申请
    INDEFINITE SPEECH INPUTS 审中-公开
    独立的语音输入

    公开(公告)号:WO2014116543A1

    公开(公告)日:2014-07-31

    申请号:PCT/US2014/012224

    申请日:2014-01-21

    CPC classification number: G10L21/10 G10L15/1815 G10L15/22

    Abstract: Embodiments are disclosed that relate to the use of speech inputs including indefinite quantitative terms as computing device inputs. For example, one disclosed embodiment provides a method of operating a computing device, the method including receiving a speech input comprising an indefinite quantitative term, determining a definite quantity corresponding to the indefinite quantitative term, and applying the definite quantity to an action performed via the computing device in response to the speech input.

    Abstract translation: 公开了涉及使用语音输入的实施例,包括不确定的定量术语作为计算设备输入。 例如,一个公开的实施例提供了一种操作计算设备的方法,该方法包括接收包括不确定定量项的语音输入,确定对应于不定数量项的确定量,以及将确定量应用于经由 计算设备响应于语音输入。

    ERGONOMIC PHYSICAL INTERACTION ZONE CURSOR MAPPING
    4.
    发明申请
    ERGONOMIC PHYSICAL INTERACTION ZONE CURSOR MAPPING 审中-公开
    人体物理交互区域光标映射

    公开(公告)号:WO2015017294A1

    公开(公告)日:2015-02-05

    申请号:PCT/US2014/048340

    申请日:2014-07-28

    Abstract: Users move their hands in a three dimensional ("3D") physical interaction zone ("PHIZ") to control a cursor in a user interface ("UI") shown on a computer-coupled 2D display such as a television or monitor. The PHIZ is shaped, sized, and positioned relative to the user to ergonomically match the user's natural range of motions so that cursor control is intuitive and comfortable over the entire region on the UI that supports cursor interaction. A motion capture system tracks the user's hand so that the user's 3D motions within the PHIZ can be mapped to the 2D UI. Accordingly, when the user moves his or her hands in the PHIZ, the cursor correspondingly moves on the display. Movement in the z direction (i.e., back and forth) in the PHIZ allows for additional interactions to be performed such as pressing, zooming, 3D manipulations, or other forms of input to the UI.

    Abstract translation: 用户通过三维(“3D”)物理交互区域(“PHIZ”)移动他们的手来控制在诸如电视或监视器的计算机耦合的2D显示器上所示的用户界面(“UI”)中的光标。 PHIZ的形状,大小和相对于用户定位,以符合人体工程学的方式,匹配用户的自然运动范围,使光标控制在支持光标交互的UI上的整个区域上直观和舒适。 运动捕捉系统跟踪用户的手,使得PHIZ内的用户的3D运动可以被映射到2D UI。 因此,当用户在PHIZ中移动他或她的手时,光标在显示器上相应地移动。 在PHIZ中的z方向(即来回)的移动允许执行额外的交互,例如按压,缩放,3D操纵或其他形式的输入到UI。

    DETECTING NATURAL USER-INPUT ENGAGEMENT
    5.
    发明申请
    DETECTING NATURAL USER-INPUT ENGAGEMENT 审中-公开
    检测自然用户输入参与

    公开(公告)号:WO2014124065A1

    公开(公告)日:2014-08-14

    申请号:PCT/US2014/014972

    申请日:2014-02-06

    CPC classification number: G06F3/005 G06F3/011 G06F3/017

    Abstract: An NUI system to provide user input to a computer system. The NUI system includes a logic machine and an instruction-storage machine. The instruction-storage machine holds instructions that, when executed by the logic machine, cause the logic machine to detect an engagement gesture from a human subject or to compute an engagement metric reflecting the degree of the subject's engagement. The instructions also cause the logic machine to direct gesture-based user input from the subject to the computer system as soon as the engagement gesture is detected or the engagement metric exceeds a threshold.

    Abstract translation: 一个用于向计算机系统提供用户输入的NUI系统。 NUI系统包括逻辑机和指令存储机。 指令存储机器保存指令,当由逻辑机器执行时,该逻辑机器使得逻辑机器检测来自人类对象的接合手势或者计算反映受试者参与程度的接合度量。 一旦检测到接合手势或者接合度量超过阈值,指令还使得逻辑机器将基于姿势的用户输入从对象引导到计算机系统。

    CONTENT SYSTEM WITH SECONDARY TOUCH CONTROLLER
    6.
    发明申请
    CONTENT SYSTEM WITH SECONDARY TOUCH CONTROLLER 审中-公开
    具有二次触控控制器的内容系统

    公开(公告)号:WO2013095946A1

    公开(公告)日:2013-06-27

    申请号:PCT/US2012/068321

    申请日:2012-12-06

    Abstract: A controller for a content presentation and interaction system which includes a primary content presentation device. The controller includes a tactile control input and a touch screen control input. The tactile control input is responsive to the inputs of a first user and communicatively coupled to the content presentation device. The controller a plurality of tactile input mechanisms and provides a first set of the plurality of control inputs manipulating content. The controller includes a touch screen control input responsive to the inputs of the first user and communicatively coupled to the content presentation device. The second controller is proximate the first controller and provides a second set of the plurality of control inputs. The second set of control inputs includes alternative inputs for at least some of the controls and additional inputs not available using the tactile input mechanisms.

    Abstract translation: 一种用于内容呈现和交互系统的控制器,其包括主要内容呈现设备。 控制器包括触觉控制输入和触摸屏控制输入。 触觉控制输入响应于第一用户的输入并且通信地耦合到内容呈现设备。 控制器具有多个触觉输入机构,并提供操纵内容的多个控制输入的第一组。 控制器包括响应于第一用户的输入并且通信地耦合到内容呈现设备的触摸屏控制输入。 第二控制器靠近第一控制器并且提供多个控制输入的第二组。 第二组控制输入包括用于至少一些控制的备选输入和使用触觉输入机构不可用的附加输入。

    USING VISUAL CUES TO DISAMBIGUATE SPEECH INPUTS
    7.
    发明申请
    USING VISUAL CUES TO DISAMBIGUATE SPEECH INPUTS 审中-公开
    使用视觉来减少语音输入

    公开(公告)号:WO2014116614A1

    公开(公告)日:2014-07-31

    申请号:PCT/US2014/012409

    申请日:2014-01-22

    Inventor: KLEIN, Christian

    Abstract: Embodiments related to recognizing speech inputs are disclosed. One disclosed embodiment provides a method for recognizing a speech input including receiving depth information of a physical space from a depth camera, determining an identity of a user in the physical space based on the depth information, receiving audio information from one or more microphones, and determining a speech input from the audio input. If the speech input comprises an ambiguous term, the ambiguous term in the speech input is compared to one or more of depth image data received from the depth image sensor and digital content consumption information for the user to identify an unambiguous term corresponding to the ambiguous term. After identifying the unambiguous term, an action is taken on the computing device based on the speech input and the unambiguous term.

    Abstract translation: 公开了与识别语音输入相关的实施例。 一个公开的实施例提供了一种用于识别语音输入的方法,包括从深度相机接收物理空间的深度信息,基于深度信息确定物理空间中的用户的身份,从一个或多个麦克风接收音频信息,以及 确定来自音频输入的语音输入。 如果语音输入包括模糊项,则将语音输入中的模糊项与从深度图像传感器接收的深度图像数据和用于用户的数字内容消费信息的一个或多个进行比较,以识别与歧义项对应的明确术语 。 在识别明确的术语之后,基于语音输入和明确的术语对计算设备采取动作。

    ADAPTIVE AREA CURSOR
    8.
    发明申请
    ADAPTIVE AREA CURSOR 审中-公开
    自适应区域光标

    公开(公告)号:WO2013074333A1

    公开(公告)日:2013-05-23

    申请号:PCT/US2012/063738

    申请日:2012-11-06

    CPC classification number: G06F3/04812

    Abstract: Described is technology by which a user's cursor movement is assisted to help select elements of a user interface that may be otherwise difficult to target. An area cursor is provided that may intersect more than one element. If so, a computation result (e.g., percentage) is computed for each intersected element that is based upon intersection with the cursor and a total size of the element; the largest percentage intersection is selected. The computation (e.g., intersected area divided by total element area) favors smaller elements as they have a smaller area in the denominator. Also described is changing the cursor size to help hit elements and/or based upon one or more criteria. Still further described is determining the total size of an element based upon weighting, in addition to or instead of the element's actual size. Weighting may be based upon one or more criteria.

    Abstract translation: 描述了通过其辅助用户的光标移动以帮助选择可能以其他方式难以定向的用户界面的元素的技术。 提供可以与多于一个元素相交的区域光标。 如果是,则基于与光标的交集和元素的总大小来计算针对每个相交元素的计算结果(例如,百分比) 选择最大的百分点。 计算(例如,相交面积除以总元素面积)有利于较小的元素,因为它们在分母中具有较小的面积。 还描述了改变光标大小以帮助命中元素和/或基于一个或多个标准。 还进一步描述的是除了元素的实际尺寸之外或代替元素的实际尺寸,基于加权来确定元素的总大小。 加权可以基于一个或多个标准。

    GESTURE DISAMBIGUATION USING ORIENTATION INFORMATION
    10.
    发明申请
    GESTURE DISAMBIGUATION USING ORIENTATION INFORMATION 审中-公开
    使用方位信息进行姿态消除

    公开(公告)号:WO2015066659A1

    公开(公告)日:2015-05-07

    申请号:PCT/US2014/063765

    申请日:2014-11-04

    CPC classification number: G06F3/017 G06F3/011

    Abstract: Embodiments are disclosed that relate to controlling a computing device based upon gesture input. In one embodiment, orientation information of the human subject is received, wherein the orientation information includes information regarding an orientation of a first body part and an orientation of a second body part. A gesture performed by the first body part is identified based on the orientation information, and an orientation of the second body part is identified based on the orientation information. A mapping of the gesture to an action performed by the computing device is determined based on the orientation of the second body part.

    Abstract translation: 公开了涉及基于手势输入来控制计算设备的实施例。 在一个实施例中,接收人体对象的姿态信息,其中取向信息包括关于第一身体部位的取向和第二身体部位的取向的信息。 基于取向信息来识别由第一身体部分执行的手势,并且基于取向信息来识别第二身体部分的取向。 基于第二身体部位的取向来确定手势对由计算装置执行的动作的映射。

Patent Agency Ranking