Invention Grant
- Patent Title: Systems and methods for identifying speech based on spectral features
-
Application No.: US16542871Application Date: 2019-08-16
-
Publication No.: US10546598B2Publication Date: 2020-01-28
- Inventor: Tom Médioni
- Applicant: GoPro, Inc.
- Applicant Address: US CA San Mateo
- Assignee: GoPro, Inc.
- Current Assignee: GoPro, Inc.
- Current Assignee Address: US CA San Mateo
- Agency: Esplin & Associates, PC
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L25/78 ; G10L25/21 ; G10L25/18 ; G10L15/04 ; G10L15/22

Abstract:
Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. The audio segments may include a first audio segment corresponding to a first portion of the duration. Energy features, entropy features, frequency features, and/or other features of the audio segments may be determined. Energy features may characterize energy of the audio segments. Entropy features may characterize spectral flatness of the audio segments. Frequency features may characterize highest frequencies of the audio segments. One or more of the audio segments may be identified as containing speech based on the energy features, the entropy features, the frequency features, and/or other information. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.
Public/Granted literature
- US20190371358A1 SYSTEMS AND METHODS FOR IDENTIFYING SPEECH BASED ON SPECTRAL FEATURES Public/Granted day:2019-12-05
Information query