Eye gaze for spoken language understanding in multi-modal conversational interactions

Invention Grant

US10317992B2 Eye gaze for spoken language understanding in multi-modal conversational interactions 有权

Please log in to see more content

Patent Title: Eye gaze for spoken language understanding in multi-modal conversational interactions
Application No.: US14496538

Application Date: 2014-09-25
Publication No.: US10317992B2

Publication Date: 2019-06-11
Inventor: Anna Prokofieva , Fethiye Asli Celikyilmaz , Dilek Z Hakkani-Tur , Larry Heck , Malcolm Slaney
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Schwegman Lundberg & Woessner, P.A.
Main IPC: G06F3/01
IPC: G06F3/01 ; G10L17/22 ; G10L15/08 ; G06F3/16 ; G06K9/00 ; G10L15/00 ; G02B27/00

Eye gaze for spoken language understanding in multi-modal conversational interactions

Abstract:

Improving accuracy in understanding and/or resolving references to visual elements in a visual context associated with a computerized conversational system is described. Techniques described herein leverage gaze input with gestures and/or speech input to improve spoken language understanding in computerized conversational systems. Leveraging gaze input and speech input improves spoken language understanding in conversational systems by improving the accuracy by which the system can resolve references—or interpret a user's intent—with respect to visual elements in a visual context. In at least one example, the techniques herein describe tracking gaze to generate gaze input, recognizing speech input, and extracting gaze features and lexical features from the user input. Based at least in part on the gaze features and lexical features, user utterances directed to visual elements in a visual context can be resolved.

Public/Granted literature

US20160091967A1 Eye Gaze for Spoken Language Understanding in Multi-Modal Conversational Interactions Public/Granted day:2016-03-31

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F3/00	用于将所要处理的数据转变成为计算机能够处理的形式的输入装置；用于将数据从处理机传送到输出设备的输出装置，例如，接口装置
G06F3/01	.用于用户和计算机之间交互的输入装置或输入和输出组合装置（G06F3/16优先）