Process and associated system for separating a specified component and an audio background component from an audio mixture signal
Abstract:
Processes are described herein for transforming an audio mixture signal data structure into a specified component data structure and a background component data structure. In the processes described herein, pitch differences between a guide signal and a dialogue component of an audio mixture signal are accounted for by explicit modeling. Processes described herein can involve obtaining an audio guide signal data structure that corresponds to a dubbing of the specified component, determining parametric spectrogram model data structures for spectrograms of the specified component and the background component, estimating parameters of the parametric spectrogram model data structures to produce data structures representing, a temporary specified signal and a temporary background signal, and filtering the audio mixture signal data structure using the data structures representing the temporary specified signal and the temporary background signal in order to produce data structures representing a specified audio signal and an audio background signal.
Information query
Patent Agency Ranking
0/0