Multichannel audio encode and decode using directional metadata

Invention Grant

US12315523B2 Multichannel audio encode and decode using directional metadata 有权

Please log in to see more content

Patent Title: Multichannel audio encode and decode using directional metadata
Application No.: US18584290

Application Date: 2024-02-22
Publication No.: US12315523B2

Publication Date: 2025-05-27
Inventor: David McGrath
Applicant: Dolby Laboratories Licensing Corporation
Applicant Address: US CA San Francisco
Assignee: Dolby Laboratories Licensing Corporation
Current Assignee: Dolby Laboratories Licensing Corporation
Current Assignee Address: US CA San Francisco
Main IPC: G10L19/008
IPC: G10L19/008 ; G10L19/02

Multichannel audio encode and decode using directional metadata

Abstract:

Spatial audio signals are processed to generate a compressed representation of the spatial audio signal. Methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The compressed representation of a spatial audio signal can be further processed to generate a reconstructed representation of the spatial audio signal.

Public/Granted literature

US20240282321A1 MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA Public/Granted day:2024-08-22

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L19/00	用于冗余度下降情形（例如在声码器中）的语音或音频信号分析-合成技术；语音或音频信号编码或解码，采用源滤波器模型或心理声学分析（乐器中的入G10H）
G10L19/008	.多通道音频信号编码和解码，采用通道间的相关性以减少冗余度，例如联合立体声，强度编码或矩阵变换