Systems and methods for identifying speech based on spectral features
Abstract:
Audio information defining audio content may be accessed. The audio content may have a duration. The audio content may be segmented into audio segments. Individual audio segments may correspond to a portion of the duration. The audio segments may include a first audio segment corresponding to a first portion of the duration. Energy features, entropy features, frequency features, and/or other features of the audio segments may be determined. Energy features may characterize energy of the audio segments. Entropy features may characterize spectral flatness of the audio segments. Frequency features may characterize highest frequencies of the audio segments. One or more of the audio segments may be identified as containing speech based on the energy features, the entropy features, the frequency features, and/or other information. Storage of the identification of the one or more of the audio segments as containing speech in one or more storage media may be effectuated.
Information query
Patent Agency Ranking
0/0