Multichannel audio encode and decode using directional metadata
Abstract:
Spatial audio signals are processed to generate a compressed representation of the spatial audio signal. Methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The compressed representation of a spatial audio signal can be further processed to generate a reconstructed representation of the spatial audio signal.
Public/Granted literature
Information query
Patent Agency Ranking
0/0