Systems and methods for identifying speech based on spectral features

Invention Grant

US10546598B2 Systems and methods for identifying speech based on spectral features 有权

Please log in to see more content

Patent Title: Systems and methods for identifying speech based on spectral features
Application No.: US16542871

Application Date: 2019-08-16
Publication No.: US10546598B2

Publication Date: 2020-01-28
Inventor: Tom Médioni
Applicant: GoPro, Inc.
Applicant Address: US CA San Mateo
Assignee: GoPro, Inc.
Current Assignee: GoPro, Inc.
Current Assignee Address: US CA San Mateo
Agency: Esplin & Associates, PC
Main IPC: G10L15/00
IPC: G10L15/00 ; G10L25/78 ; G10L25/21 ; G10L25/18 ; G10L15/04 ; G10L15/22

Systems and methods for identifying speech based on spectral features

Abstract:

Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. The audio segments may include a first audio segment corresponding to a first portion of the duration. Energy features, entropy features, frequency features, and/or other features of the audio segments may be determined. Energy features may characterize energy of the audio segments. Entropy features may characterize spectral flatness of the audio segments. Frequency features may characterize highest frequencies of the audio segments. One or more of the audio segments may be identified as containing speech based on the energy features, the entropy features, the frequency features, and/or other information. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.

Public/Granted literature

US20190371358A1 SYSTEMS AND METHODS FOR IDENTIFYING SPEECH BASED ON SPECTRAL FEATURES Public/Granted day:2019-12-05

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）