Abstract:
일반적으로, 다중트랜지션들동안고차앰비소닉계수들을코딩하기위한기법들이설명된다. 프로세서및 프로세서에커플링된메모리를포함하는디바이스가이 기법들을수행하도록구성될수도있다. 프로세서는, 주변 HOA 계수가포어그라운드오디오신호가트랜지션중일때 비트스트림의동일한프레임동안트랜지션중인지여부의멀티-트랜지션표시를획득하도록구성될수도있다. 프로세서는또한, 멀티-트랜지션표시에기초하여대응하는포어그라운드오디오신호의공간적특징을기술하는벡터를획득하도록구성될수도있고, 벡터및 대응하는 HOA 오디오신호양자모두는 HOA 오디오데이터로부터분해된다. 메모리는벡터를저장하도록구성될수도있다.
Abstract:
In general, techniques are described for specifying spherical harmonic coefficients in a bitstream. A device comprising one or more processors may perform the techniques. The processors may be configured to identify, from the bitstream, a plurality of hierarchical elements describing a sound field that are included in the bitstream. The processors may further be configured to parse the bitstream to determine the identified plurality of hierarchical elements.
Abstract:
In general, techniques are described for obtaining decomposed versions of spherical harmonic coefficients. In accordance with these techniques, a device comprising one or more processors may be configured to determine a first non-zero set of coefficients of a vector that represent a distinct component of a sound field, the vector having been decomposed from a plurality of spherical harmonic coefficients that describe the sound field.
Abstract:
En general, se describen técnicas para indicar la reutilización de parámetros de trama para vectores de decodificación. Un dispositivo que comprende un procesador y una memoria puede realizar las técnicas. El procesador puede configurarse para obtener un flujo de bits que comprende un vector representativo de un eje espacial ortogonal en un dominio de armónicos esféricos. El flujo de bits puede comprender además un indicador de si se debe reutilizar, de un cuadro anterior, al menos un elemento de sintaxis indicativo de la información utilizada al comprimir el vector. La memoria puede configurarse para almacenar el flujo de bits. (Traducción automática con Google Translate, sin valor legal)
Abstract:
In general, techniques are described for performing a vector-based synthesis with respect to higher order ambisonic coefficients (or, in other words, spherical harmonic coefficients). A device comprising a processor may be configured to perform the techniques. The processor may perform the vector-based synthesis with respect to spherical harmonic coefficients to generate decomposed representations of the plurality of spherical harmonic coefficients and determine distinct and background directional information from the directional information. The processor may then reduce an order of the directional information associated with the background audio objects to generate transformed background directional information, and apply compensation to increase values of the transformed directional information to preserve an overall energy of the sound field.
Abstract:
In general, techniques are described for specifying audio rendering information (39) in a bitstream (31). A device (36) configured to generate the bitstream (31) may perform various aspects of the techniques. The bitstream generation device (36) may comprise one or more processors configured to specify audio rendering information (39) that includes a signal value identifying an audio renderer (28) used when generating the multi-channel audio content. A device configured to render multi-channel audio content from a bitstream (31) may also perform various aspects of the techniques. The rendering device may comprise one or more processors configured to determine audio rendering information that includes a signal value identifying an audio renderer (34) used when generating the multi-channel audio content, and render a plurality of speaker feeds (35) based on the audio rendering information (39). (Figure 5)
Abstract:
In general, techniques are described for signaling layers for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of layers specified in the bitstream, and obtain the layers of the bitstream based on the indication of the number of layers.
Abstract:
In general, techniques are described for coding an ambient higher order ambisonic coefficient. An audio decoding device comprising a memory and a processor may perform the techniques. The memory may store a first frame of a bitstream and a second frame of the bitstream. The processor may obtain, from the first frame, one or more bits indicative of whether the first frame is an independent frame that includes additional reference information to enable the first frame to be decoded without reference to the second frame. The processor may further obtain, in response to the one or more bits indicating that the first frame is not an independent frame, prediction information for first channel side information data of a transport channel. The prediction information may be used to decode the first channel side information data of the transport channel with reference to second channel side information data of the transport channel.
Abstract:
In general, techniques are described for indicating frame parameter reusability for decoding vectors. A device comprising a processor and a memory may perform the techniques. The processor may be configured to obtain a bitstream (21, 450) comprising a vector representative of an orthogonal spatial axis in a spherical harmonics domain. The bitstream may further comprise an indicator for whether to reuse, from a previous frame, at least one syntax element indicative of information used when compressing the vector. The memory may be configured to store the bitstream.