Abstract:
In general, techniques are described for coding of vectors decomposed from higher order ambisonic coefficients. A device comprising a processor and a memory may perform the techniques. The processor may be configured to obtain from a bitstream data indicative of a plurality of weight values that represent a vector that is included in a decomposed version of the plurality of HOA coefficients. Each of the weight values may correspond to a respective one of a plurality of weights in a weighted sum of code vectors that represents the vector and that includes a set of code vectors. The processor may further be configured to reconstruct the vector based on the weight values and the code vectors. The memory may be configured to store the reconstructed vector.
Abstract:
In general, techniques are described for performing codebook selection when coding vectors decomposed from higher-order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store a plurality of codebooks to use when performing vector dequantization with respect to a vector quantized spatial component of a soundfield. The vector quantized spatial component may be obtained through application of a decomposition to a plurality of higher order ambisonic coefficients. The processor may be configured to select one of the plurality of codebooks.
Abstract:
In general, techniques are described for compressing decomposed representations of a sound field. A device comprising one or more processors may be configured to perform the techniques. The one or more processors may be configured to obtain a bitstream comprising a compressed version of a spatial component of a sound field, the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.
Abstract:
In general, techniques are described for transitioning an ambient higher order ambisonic coefficient. A device comprising a memory and a processor may be configured to perform the techniques. The processor may obtain, from a frame of a bitstream of encoded audio data, a bit indicative of a reduced vector. The reduced vector may represent, at least in part, a spatial component of a sound field. The processor may also obtain, from the frame, a bit indicative of a transition of an ambient higher-order ambisonic coefficient. The ambient higher-order ambisonic coefficient may represent, at least in part, an ambient component of the sound field. The reduced vector may include a vector element associated with the ambient higher-order ambisonic coefficient in transition. The memory may be configured to store the frame of the bitstream.
Abstract:
Systems and techniques for rendering audio data are generally disclosed. An example device for rendering a higher order ambisonic (HOA) audio signal includes a memory configured to store the HOA audio signal, and one or more processors coupled to the memory. The one or more processors are configured to perform a loudness compensation process as part of generating an effect matrix. The one or more processors are further configured to render the HOA audio signal based on the effect matrix.
Abstract:
In general, techniques are described for transforming spherical harmonic coefficients. A device comprising one or more processors may perform the techniques. The processors may be configured to parse the bitstream to determine transformation information describing how the sound field was transformed to reduce a number of the plurality of hierarchical elements that provide information relevant in describing the sound field. The processors may further be configured to, when reproducing the sound field based on those of the plurality of hierarchical elements that provide information relevant in describing the sound field, transform the sound field based on the transformation information to reverse the transformation performed to reduce the number of the plurality of hierarchical elements.
Abstract:
Un dispositivo configurado para comprimir datos de audio ambisónicos de orden superior representativos de un campo de sonido, comprendiendo el dispositivo: una memoria configurada para almacenar coeficientes ambisónicos de orden superior de los datos de audio ambisónicos de orden superior; y uno o más procesadores configurados para: descomponer (1002) los coeficientes ambisónicos de orden superior en un componente de sonido predominante y un componente espacial correspondiente, el componente espacial correspondiente representativo de las direcciones, la forma y el ancho del componente de sonido predominante, y definido en un dominio armónico esférico, en el que el componente espacial comprende elementos; especificar (1006), en un flujo de bits conforme a un formato de compresión intermedia, un subconjunto de coeficientes ambisónicos de orden superior que representan un componente ambiental del campo de sonido; y determinar que al menos uno de los elementos del componente espacial es redundante con respecto a la información proporcionada por el subconjunto de coeficientes ambisónicos de orden superior que representan el componente ambiental del campo de sonido; caracterizado porque uno o más procesadores se configuran para especificar (1008), en el flujo de bits, e independientemente de la determinación de que al menos uno de los elementos del componente espacial es redundante, todos los elementos del componente espacial.
Abstract:
In general, techniques are described for coding higher-order ambisonic coefficients during multiple transitions. A device comprising a processor and a memory coupled to the processor may be configured to perform the techniques. The processor may be configured to obtain a multi-transition indication of whether an ambient HOA coefficient is in transition during a same frame of the bitstream as a foreground audio signal is in transition. The processor may also be configured to obtain a vector that describes a spatial characteristic of a corresponding foreground audio signal based on the multi-transition indication, both the vector and the corresponding HOA audio signal decomposed from the HOA audio data. The memory may be configured to store the vector.
Abstract:
indicação de reusabilidade de parâmetros de quadros para vetores de codificação em geral, são descritas técnicas para indicar a reusabilidade de parâmetros de quadro para vetores de decodificação. um dispositivo compreendendo um processador e uma memória pode executar as técnicas. o processador pode ser configurado para obter um fluxo de bits compreendendo um vetor representativo de um eixo espacial ortogonal em um domínio de harmônicos esféricos. o fluxo de dados pode compreender ainda um indicador quanto à reutilização, a partir de um quadro anterior, de pelo menos um elemento de sintaxe indicativo da informação utilizada quando se comprime o vetor. a memória pode ser configurada para armazenar o fluxo de bits.