Invention Grant
- Patent Title: System and method for continuous multimodal speech and gesture interaction
-
Application No.: US15651315Application Date: 2017-07-17
-
Publication No.: US10540140B2Publication Date: 2020-01-21
- Inventor: Michael Johnston , Derya Ozkan
- Applicant: Nuance Communications, Inc.
- Applicant Address: US MA Burlington
- Assignee: NUANCE COMMUNICATIONS, INC.
- Current Assignee: NUANCE COMMUNICATIONS, INC.
- Current Assignee Address: US MA Burlington
- Main IPC: G10L15/22
- IPC: G10L15/22 ; G06F3/16 ; G06F3/01

Abstract:
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with a time of the speech event, and analyzes data from the gesture input stream within the temporal window to identify a gesture event. The system processes the speech event and the gesture event to produce a multimodal command. The gesture in the gesture input stream can be directed to a display, but is remote from the display. The system can analyze the data from the gesture input stream by calculating an average of gesture coordinates within the temporal window.
Public/Granted literature
- US20180004482A1 SYSTEM AND METHOD FOR CONTINUOUS MULTIMODAL SPEECH AND GESTURE INTERACTION Public/Granted day:2018-01-04
Information query