Classifying segments of speech based on acoustic features and context

Invention Grant

US10311863B2 Classifying segments of speech based on acoustic features and context 有权

Please log in to see more content

Patent Title: Classifying segments of speech based on acoustic features and context
Application No.: US15256378

Application Date: 2016-09-02
Publication No.: US10311863B2

Publication Date: 2019-06-04
Inventor: Jill Fain Lehman , Nikolas Wolfe , Andre Pereira
Applicant: Disney Enterprises, Inc.
Applicant Address: US CA Burbank
Assignee: Disney Enterprises, Inc.
Current Assignee: Disney Enterprises, Inc.
Current Assignee Address: US CA Burbank
Agency: Farjami & Farjami LLP
Main IPC: G10L15/18
IPC: G10L15/18 ; G10L15/22 ; G10L15/02 ; G10L15/08

Classifying segments of speech based on acoustic features and context

Abstract:

There is provided a system including a microphone configured to receive an input speech, an analog to digital (A/D) converter configured to convert the input speech to a digital form and generate a digitized speech including a plurality of segments having acoustic features, a memory storing an executable code, and a processor executing the executable code to extract a plurality of acoustic feature vectors from a first segment of the digitized speech, determine, based on the plurality of acoustic feature vectors, a plurality of probability distribution vectors corresponding to the probabilities that the first segment includes each of a first keyword, a second keyword, both the first keyword and the second keyword, a background, and a social speech, and assign a first classification label to the first segment based on an analysis of the plurality of probability distribution vectors of one or more segments preceding the first segment and the probability distribution vectors of the first segment.

Public/Granted literature

US20180068656A1 Classifying Segments of Speech Based on Acoustic Features and Context Public/Granted day:2018-03-08

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/18	..利用自然语言模型