Invention Grant
- Patent Title: Eye gaze for spoken language understanding in multi-modal conversational interactions
-
Application No.: US14496538Application Date: 2014-09-25
-
Publication No.: US10317992B2Publication Date: 2019-06-11
- Inventor: Anna Prokofieva , Fethiye Asli Celikyilmaz , Dilek Z Hakkani-Tur , Larry Heck , Malcolm Slaney
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: G06F3/01
- IPC: G06F3/01 ; G10L17/22 ; G10L15/08 ; G06F3/16 ; G06K9/00 ; G10L15/00 ; G02B27/00

Abstract:
Improving accuracy in understanding and/or resolving references to visual elements in a visual context associated with a computerized conversational system is described. Techniques described herein leverage gaze input with gestures and/or speech input to improve spoken language understanding in computerized conversational systems. Leveraging gaze input and speech input improves spoken language understanding in conversational systems by improving the accuracy by which the system can resolve references—or interpret a user's intent—with respect to visual elements in a visual context. In at least one example, the techniques herein describe tracking gaze to generate gaze input, recognizing speech input, and extracting gaze features and lexical features from the user input. Based at least in part on the gaze features and lexical features, user utterances directed to visual elements in a visual context can be resolved.
Public/Granted literature
- US20160091967A1 Eye Gaze for Spoken Language Understanding in Multi-Modal Conversational Interactions Public/Granted day:2016-03-31
Information query