-
公开(公告)号:WO2014116548A1
公开(公告)日:2014-07-31
申请号:PCT/US2014/012229
申请日:2014-01-21
Applicant: MICROSOFT CORPORATION
Inventor: KLEIN, Christian , NIMAN, Meg
CPC classification number: G10L21/10 , G06F3/0304 , G06F3/167 , G10L2015/225
Abstract: Embodiments are disclosed that relate to providing visual feedback in a speech recognition system. For example, one disclosed embodiment provides a method including displaying a graphical feedback indicator having a variable appearance dependent upon a state of the speech recognition system. The method further comprises receiving a speech input, modifying an appearance of the graphical feedback indicator in a first manner if the speech input is heard and understood by the system, and modifying the appearance of the graphical feedback indicator in a different manner than the first manner if the speech input is heard and not understood.
Abstract translation: 公开了涉及在语音识别系统中提供视觉反馈的实施例。 例如,一个公开的实施例提供了一种方法,包括根据语音识别系统的状态显示具有可变外观的图形反馈指示符。 该方法还包括接收语音输入,如果系统听到和理解了语音输入,则以第一种方式修改图形反馈指示符的外观,并以与第一种方式不同的方式修改图形反馈指示符的外观 如果语音输入被听到并且不被理解。
-
公开(公告)号:WO2014116543A1
公开(公告)日:2014-07-31
申请号:PCT/US2014/012224
申请日:2014-01-21
Applicant: MICROSOFT CORPORATION
Inventor: KLEIN, Christian , WYGONIK, Gregg
CPC classification number: G10L21/10 , G10L15/1815 , G10L15/22
Abstract: Embodiments are disclosed that relate to the use of speech inputs including indefinite quantitative terms as computing device inputs. For example, one disclosed embodiment provides a method of operating a computing device, the method including receiving a speech input comprising an indefinite quantitative term, determining a definite quantity corresponding to the indefinite quantitative term, and applying the definite quantity to an action performed via the computing device in response to the speech input.
Abstract translation: 公开了涉及使用语音输入的实施例,包括不确定的定量术语作为计算设备输入。 例如,一个公开的实施例提供了一种操作计算设备的方法,该方法包括接收包括不确定定量项的语音输入,确定对应于不定数量项的确定量,以及将确定量应用于经由 计算设备响应于语音输入。
-
公开(公告)号:WO2011100254A2
公开(公告)日:2011-08-18
申请号:PCT/US2011/024078
申请日:2011-02-08
Applicant: MICROSOFT CORPORATION
Inventor: MATTINGLY, Andrew , HILL, Jeremy , DAYAL, Arjun , KRAMP, Brian , VASSIGH, Ali , KLEIN, Christian , POULOS, Adam , KIPMAN, Alex , MARGOLIS, Jeffrey
CPC classification number: G06F3/017 , G06F3/0304 , G06F3/04812 , G06F3/04842
Abstract: A system is disclosed for providing on-screen graphical handles to control interaction between a user and on-screen objects. A handle defines what actions a user may perform on the object, such as for example scrolling through a textual or graphical navigation menu. Affordances are provided to guide the user through the process of interacting with a handle.
Abstract translation: 公开了一种用于提供屏幕上的图形句柄以控制用户和屏幕上的对象之间的交互的系统。 句柄定义了用户可以在该对象上执行的操作,例如滚动文本或图形导航菜单。 提供可用性指导用户完成与句柄交互的过程。 p>
-
公开(公告)号:WO2015017294A1
公开(公告)日:2015-02-05
申请号:PCT/US2014/048340
申请日:2014-07-28
Applicant: MICROSOFT CORPORATION
Inventor: BAILEY, Richard , BASTIEN, David , SCHWESINGER, Mark , YANG, Emily , SMITH, Adam , MURILLO, Oscar , FRANKLIN, Tim , ANDERSEN, Jordan , KLEIN, Christian
IPC: G06F3/01 , G06K9/00 , G06F3/0482 , G06F3/03
CPC classification number: G06F3/0346 , G06F3/011 , G06F3/017 , G06F3/0304 , G06F3/0482 , G06K9/00335 , G06K9/00355
Abstract: Users move their hands in a three dimensional ("3D") physical interaction zone ("PHIZ") to control a cursor in a user interface ("UI") shown on a computer-coupled 2D display such as a television or monitor. The PHIZ is shaped, sized, and positioned relative to the user to ergonomically match the user's natural range of motions so that cursor control is intuitive and comfortable over the entire region on the UI that supports cursor interaction. A motion capture system tracks the user's hand so that the user's 3D motions within the PHIZ can be mapped to the 2D UI. Accordingly, when the user moves his or her hands in the PHIZ, the cursor correspondingly moves on the display. Movement in the z direction (i.e., back and forth) in the PHIZ allows for additional interactions to be performed such as pressing, zooming, 3D manipulations, or other forms of input to the UI.
Abstract translation: 用户通过三维(“3D”)物理交互区域(“PHIZ”)移动他们的手来控制在诸如电视或监视器的计算机耦合的2D显示器上所示的用户界面(“UI”)中的光标。 PHIZ的形状,大小和相对于用户定位,以符合人体工程学的方式,匹配用户的自然运动范围,使光标控制在支持光标交互的UI上的整个区域上直观和舒适。 运动捕捉系统跟踪用户的手,使得PHIZ内的用户的3D运动可以被映射到2D UI。 因此,当用户在PHIZ中移动他或她的手时,光标在显示器上相应地移动。 在PHIZ中的z方向(即来回)的移动允许执行额外的交互,例如按压,缩放,3D操纵或其他形式的输入到UI。
-
公开(公告)号:WO2014124065A1
公开(公告)日:2014-08-14
申请号:PCT/US2014/014972
申请日:2014-02-06
Applicant: MICROSOFT CORPORATION
Inventor: SCHWESINGER, Mark , ESCARDO RAFFO, Eduardo , MURILLO, Oscar , BASTIEN, David , AHN, Matthew H. , GIUSTI, Mauro , ENDRES, Kevin , KLEIN, Christian , SCHWARZ, Julia , MARAIS, Charles Claudius
IPC: G06F3/01
Abstract: An NUI system to provide user input to a computer system. The NUI system includes a logic machine and an instruction-storage machine. The instruction-storage machine holds instructions that, when executed by the logic machine, cause the logic machine to detect an engagement gesture from a human subject or to compute an engagement metric reflecting the degree of the subject's engagement. The instructions also cause the logic machine to direct gesture-based user input from the subject to the computer system as soon as the engagement gesture is detected or the engagement metric exceeds a threshold.
Abstract translation: 一个用于向计算机系统提供用户输入的NUI系统。 NUI系统包括逻辑机和指令存储机。 指令存储机器保存指令,当由逻辑机器执行时,该逻辑机器使得逻辑机器检测来自人类对象的接合手势或者计算反映受试者参与程度的接合度量。 一旦检测到接合手势或者接合度量超过阈值,指令还使得逻辑机器将基于姿势的用户输入从对象引导到计算机系统。
-
公开(公告)号:WO2013095946A1
公开(公告)日:2013-06-27
申请号:PCT/US2012/068321
申请日:2012-12-06
Applicant: MICROSOFT CORPORATION
Inventor: CLAVIN, John , LOBB, Kenneth A. , KLEIN, Christian , GEISNER, Kevin , NOVAK, Christopher M.
CPC classification number: A63F13/24 , A63F13/20 , A63F13/213 , A63F13/2145 , A63F13/235 , A63F13/30 , A63F13/335 , A63F13/42 , A63F13/90 , A63F2300/301 , G06F1/1626 , G06F1/1632 , G06F3/033 , G06F3/038 , G06F2203/0381
Abstract: A controller for a content presentation and interaction system which includes a primary content presentation device. The controller includes a tactile control input and a touch screen control input. The tactile control input is responsive to the inputs of a first user and communicatively coupled to the content presentation device. The controller a plurality of tactile input mechanisms and provides a first set of the plurality of control inputs manipulating content. The controller includes a touch screen control input responsive to the inputs of the first user and communicatively coupled to the content presentation device. The second controller is proximate the first controller and provides a second set of the plurality of control inputs. The second set of control inputs includes alternative inputs for at least some of the controls and additional inputs not available using the tactile input mechanisms.
Abstract translation: 一种用于内容呈现和交互系统的控制器,其包括主要内容呈现设备。 控制器包括触觉控制输入和触摸屏控制输入。 触觉控制输入响应于第一用户的输入并且通信地耦合到内容呈现设备。 控制器具有多个触觉输入机构,并提供操纵内容的多个控制输入的第一组。 控制器包括响应于第一用户的输入并且通信地耦合到内容呈现设备的触摸屏控制输入。 第二控制器靠近第一控制器并且提供多个控制输入的第二组。 第二组控制输入包括用于至少一些控制的备选输入和使用触觉输入机构不可用的附加输入。
-
公开(公告)号:WO2014116614A1
公开(公告)日:2014-07-31
申请号:PCT/US2014/012409
申请日:2014-01-22
Applicant: MICROSOFT CORPORATION
Inventor: KLEIN, Christian
CPC classification number: G10L15/22 , G06F3/017 , G06F3/0304 , G06F3/167 , G06F2203/0381 , G10L15/24 , G10L2015/223
Abstract: Embodiments related to recognizing speech inputs are disclosed. One disclosed embodiment provides a method for recognizing a speech input including receiving depth information of a physical space from a depth camera, determining an identity of a user in the physical space based on the depth information, receiving audio information from one or more microphones, and determining a speech input from the audio input. If the speech input comprises an ambiguous term, the ambiguous term in the speech input is compared to one or more of depth image data received from the depth image sensor and digital content consumption information for the user to identify an unambiguous term corresponding to the ambiguous term. After identifying the unambiguous term, an action is taken on the computing device based on the speech input and the unambiguous term.
Abstract translation: 公开了与识别语音输入相关的实施例。 一个公开的实施例提供了一种用于识别语音输入的方法,包括从深度相机接收物理空间的深度信息,基于深度信息确定物理空间中的用户的身份,从一个或多个麦克风接收音频信息,以及 确定来自音频输入的语音输入。 如果语音输入包括模糊项,则将语音输入中的模糊项与从深度图像传感器接收的深度图像数据和用于用户的数字内容消费信息的一个或多个进行比较,以识别与歧义项对应的明确术语 。 在识别明确的术语之后,基于语音输入和明确的术语对计算设备采取动作。
-
公开(公告)号:WO2013074333A1
公开(公告)日:2013-05-23
申请号:PCT/US2012/063738
申请日:2012-11-06
Applicant: MICROSOFT CORPORATION
Inventor: KLEIN, Christian , ROSSER, Peter D.
CPC classification number: G06F3/04812
Abstract: Described is technology by which a user's cursor movement is assisted to help select elements of a user interface that may be otherwise difficult to target. An area cursor is provided that may intersect more than one element. If so, a computation result (e.g., percentage) is computed for each intersected element that is based upon intersection with the cursor and a total size of the element; the largest percentage intersection is selected. The computation (e.g., intersected area divided by total element area) favors smaller elements as they have a smaller area in the denominator. Also described is changing the cursor size to help hit elements and/or based upon one or more criteria. Still further described is determining the total size of an element based upon weighting, in addition to or instead of the element's actual size. Weighting may be based upon one or more criteria.
Abstract translation: 描述了通过其辅助用户的光标移动以帮助选择可能以其他方式难以定向的用户界面的元素的技术。 提供可以与多于一个元素相交的区域光标。 如果是,则基于与光标的交集和元素的总大小来计算针对每个相交元素的计算结果(例如,百分比) 选择最大的百分点。 计算(例如,相交面积除以总元素面积)有利于较小的元素,因为它们在分母中具有较小的面积。 还描述了改变光标大小以帮助命中元素和/或基于一个或多个标准。 还进一步描述的是除了元素的实际尺寸之外或代替元素的实际尺寸,基于加权来确定元素的总大小。 加权可以基于一个或多个标准。
-
公开(公告)号:WO2011090829A2
公开(公告)日:2011-07-28
申请号:PCT/US2011/020396
申请日:2011-01-06
Applicant: MICROSOFT CORPORATION
Inventor: DERNIS, Mitchell , LEYVAND, Tommer , KLEIN, Christian , LI, Jinyu
CPC classification number: G06K9/00892 , A63F13/213 , A63F13/215 , A63F13/79 , G06F17/30787 , G10L17/10 , G10L2021/02166
Abstract: A system and method are disclosed for tracking image and audio data over time to automatically identify a person based on a correlation of their voice with their body in a multi-user game or multimedia setting.
Abstract translation: 公开了一种系统和方法,用于随着时间的推移跟踪图像和音频数据,以基于他们的语音与他们的身体在多用户游戏或多媒体设置中的相关性来自动识别人。 p >
-
公开(公告)号:WO2015066659A1
公开(公告)日:2015-05-07
申请号:PCT/US2014/063765
申请日:2014-11-04
Applicant: MICROSOFT CORPORATION
Inventor: SCHWESINGER, Mark , YANG, Emily , KAPUR, Jay , PAOLANTONIO, Sergio , KLEIN, Christian
IPC: G06F3/01
Abstract: Embodiments are disclosed that relate to controlling a computing device based upon gesture input. In one embodiment, orientation information of the human subject is received, wherein the orientation information includes information regarding an orientation of a first body part and an orientation of a second body part. A gesture performed by the first body part is identified based on the orientation information, and an orientation of the second body part is identified based on the orientation information. A mapping of the gesture to an action performed by the computing device is determined based on the orientation of the second body part.
Abstract translation: 公开了涉及基于手势输入来控制计算设备的实施例。 在一个实施例中,接收人体对象的姿态信息,其中取向信息包括关于第一身体部位的取向和第二身体部位的取向的信息。 基于取向信息来识别由第一身体部分执行的手势,并且基于取向信息来识别第二身体部分的取向。 基于第二身体部位的取向来确定手势对由计算装置执行的动作的映射。
-
-
-
-
-
-
-
-
-