Smart voice enhancement architecture for tempo tracking among music, speech, and noise
Abstract:
Audio data describing an audio signal may be received and used to determine a set of frames of the audio signal. A plurality of note onsets in the set of frames may be identified based on spectral energy of the audio signal in the set of frames. One or more tempos may be computed based on the identified plurality of note onsets. The one or more tempos may be validated based on a tempo validation condition. One or more music states of the audio signal may be determined based on the validated one or more tempos. Audio enhancement of the audio signal may be modified based on the one or more determined states of the audio signal.
Information query
Patent Agency Ranking
0/0