Sensor fusion model to enhance machine conversational awareness

Invention Grant

US10685648B2 Sensor fusion model to enhance machine conversational awareness 有权

Please log in to see more content

Patent Title: Sensor fusion model to enhance machine conversational awareness
Application No.: US15806438

Application Date: 2017-11-08
Publication No.: US10685648B2

Publication Date: 2020-06-16
Inventor: John J. Andersen , Dogukan Erenel , Richard O. Lyle , Connie Yee
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Stephen J. Walder, Jr.; Feb R. Cabrasawan
Main IPC: G10L15/22
IPC: G10L15/22 ; G10L15/06 ; G06F3/01 ; G10L15/16 ; G10L15/24 ; G10L15/18 ; G10L15/26 ; G10L25/48 ; G06F3/16 ; G06F40/279 ; G10L25/21

Sensor fusion model to enhance machine conversational awareness

Abstract:

Mechanisms are provided, in a smart speaker system having at least one smart speaker device comprising an audio capture device, and smart speaker system logic, for processing audio sample data captured by the audio capture device. The audio capture device captures an audio sample from a monitored environment and one or more sensor devices capture sensor data representing non-verbal attention indicators associated with a speaker of a speech portion of the audio sample. The smart speaker system logic evaluates the non-verbal attention indicators of the sensor data to determine whether or not the speech portion of the audio sample is directed to the smart speaker device. In response to determining that the speech portion of the audio sample is directed to the smart speaker device, a cognitive system associated with the smart speaker system generates a response to the speech portion.

Public/Granted literature

US20190139541A1 Sensor Fusion Model to Enhance Machine Conversational Awareness Public/Granted day:2019-05-09

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/22	.在语音识别过程中（例如在人机对话过程中）使用的程序