Systems and methods for identifying speech based on cepstral coefficients and support vector machines

Invention Grant

US10403303B1 Systems and methods for identifying speech based on cepstral coefficients and support vector machines 有权

Please log in to see more content

Patent Title: Systems and methods for identifying speech based on cepstral coefficients and support vector machines
Application No.: US15802115

Application Date: 2017-11-02
Publication No.: US10403303B1

Publication Date: 2019-09-03
Inventor: Tom Médioni , Vincent Garcia
Applicant: GoPro, Inc.
Applicant Address: US CA San Mateo
Assignee: GoPro, Inc.
Current Assignee: GoPro, Inc.
Current Assignee Address: US CA San Mateo
Agency: Esplin & Associates, PC
Main IPC: G10L25/24
IPC: G10L25/24 ; G10L25/30 ; G10L25/78 ; G10L25/21 ; G10L15/22 ; G10L25/18

Systems and methods for identifying speech based on cepstral coefficients and support vector machines

Abstract:

Audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. Mel frequency spectral power features, Mel frequency cepstral coefficient features, and energy features of the audio segments may be determined. Feature vectors of the audio segments may be determined based on the Mel frequency spectral power features, the Mel frequency cepstral coefficient features, and the energy features. The feature vectors may be processed through a support vector machine. The support vector machine may output predictions on whether the audio segments contain speech. One or more of the audio segments may be identified as containing speech based on filtering the predictions and comparing the filtered predictions to a threshold. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/03	.以提取参数类型为特征的
G10L25/24	..提取参数的倒谱