Abstract:
In a method of perceptual transform coding of audio signals in a telecommunication system, performing the steps of determining transform coefficients representative of a time to frequency transformation of a time segmented input audio signal; determining a spectrum of perceptual sub-bands for said input audio signal based on said determined transform coefficients; determining masking thresholds for each said sub-band based on said determined spectrum; computing scale factors for each said sub-band based on said determined masking thresholds, and finally adapting said computed scale factors for each said sub-band to prevent energy loss for perceptually relevant sub-bands.
Abstract:
The invention involves collective management of video (30) and audio (40) content in a content provider (100). The video (30) and audio (40) content is available in multiple potential versions (32, 34, 36; 42, 44, 46), e.g. in the form of scalable media (36, 46) or media (32, 34; 42, 44) pre-encoded to fixed bandwidth levels. The video (30) and audio (40) data is associated with bandwidth share information (62, 64, 66) enabling estimation of a respective apportionment of a total available bandwidth to the video (30) and audio (40) content. The provider (100) uses this share information (62, 64, 66) and information of the total assignable bandwidth level for providing a respective video version (32, 34, 36) and audio version (42, 44, 46) from the multiple potential versions (32, 34, 36, 42, 44, 46). This allows for increased user-quality when rendering the video (30) and audio (40) data as optimal video (32, 34, 36) and audio (42, 44, 46) version can be dynamically provided during the media session.
Abstract:
A network processing node (e.g., MGW, MRFP) and method are described herein that can: (1) receive packets on a first heterogeneous link (e.g., wireless link); (2) manipulate the received packets based on known characteristics about a second heterogeneous link (e.g., "Internet" link); and (3) send the manipulated packets on the second heterogeneous link (e.g., "Internet" link). For example, the network processing node can manipulate the received packets by adding redundancy, removing redundancy, frame aggregating (re-packetizing), recovering lost packets and/or re-transmitting packets.
Abstract:
The invention provides a highly efficient technique for encoding a multi-channel audio signal. The invention relies on the basic principle of encoding a first signal representation of one or more of the multiple channels in a first encoder (130) and encoding a second signal representation of one or more of the multiple channels in a second, multi-stage, encoder (140). This procedure is significantly enhanced by providing a controller (150) for adaptively allocating a number of encoding bits among the different encoding stages of the second, multi-stage, encoder (140) in dependence on multi-channel audio signal characteristics.
Abstract:
The invention relates to the technical field of audio encoding and/or decoding technologies, and thus concerns an overall encoding procedure and associated decoding procedure. The encoding procedure involves at least two signal encoding processes (S1-S3) operating on signal representations of a set of audio input channels, as well as residual encoding (S7-S8). It also involves a dedicated process (S4-S6) to estimate and encode energies of the audio input channels. Each encoding process is associated with a corresponding decoding process. In the overall decoding procedure the decoded signals from each encoding process are preferably combined such that the output channels are close to the input channels in terms of energy and/or quality. Normally, the combination step also adapts to the possible loss of one or more signal representation in part or in whole, such that the energy and quality is optimized with the signals at hand in the decoder. In this way, the overall quality of the output channels is improved.
Abstract:
The signal processing is based on the c oncept of using a time-domain aliased (12, TDA) frame as a basis for time segmen tation (14) and spectral analysis (16), performing segmentation in time based on the time-domain aliased frame and performing spectral analysis based on the resulting time segments. The time resolution of the overall ?segmented? time-to-frequenc y transform can thus be changed by simply adapting the time segmentation to ob tain a suitable number of time segments based on which spectral analysis is applied. The overall set of spectral coefficients, obtained for all the segments, provides a selectable time-frequency tiling of the original signal frame.
Abstract:
A vector quantizer includes a lattice quantizer (10) approximating a vector x by a lattice vector belonging to a lattice Λ 0 . A lattice vector decomposer (14) connected to the lattice quantizer successively decomposes the lattice vector into a sequence of quotient vectors y, and a sequence of remainder vectors r i on successive lattices Λ I-1 by lattice division with a corresponding predetermined sequence of integers p i ≥ 2 , where i = l...k and k is a positive integer representing the number of elements in each sequence.
Abstract translation:矢量量化器包括通过属于格子φ0的晶格矢量近似矢量x的晶格量化器(10)。 连接到晶格量化器的晶格矢量分解器(14)将晶格矢量依次分解成一个商矢量y的序列,并且在连续晶格上的剩余矢量序列ΠI-1 通过与相应的预定的整数序列p i i进行点划分
Abstract:
The invention provides an efficient technique for encoding a multi-channel audio signal. The invention relies on the principle of encoding (Sl) a signal representation of one or more of the multiple channels in a first encoding process, and encoding another signal representation of one or more channels in a second, filter-based encoding process. A basic idea according to the invention is to select (S2), for the second encoding process, a combination of i) frame division configuration of an overall encoding frame into a set of sub-frames, and ii) filter length for each sub-frame, according to a predetermined criterion. The second signal representation is then encoded (S3) in each sub-frame of the overall encoding frame according to the selected combination. The possibility to select frame division configuration and at the same time adjust the filter length for each sub-frame provides added degrees of freedom, and generally results in improved performance.
Abstract:
The invention provides an efficient technique for encoding a multi-channel audio signal. The invention relies on the principle of encoding (Sl) a signal representation of one or more of the multiple channels in a first encoding process, and encoding another signal representation of one or more channels in a second, filter-based encoding process. A basic idea according to the invention is to select (S2), for the second encoding process, a combination of i) frame division configuration of an overall encoding frame into a set of sub-frames, and ii) filter length for each sub-frame, according to a predetermined criterion. The second signal representation is then encoded (S3) in each sub-frame of the overall encoding frame according to the selected combination. The possibility to select frame division configuration and at the same time adjust the filter length for each sub-frame provides added degrees of freedom, and generally results in improved performance.
Abstract:
An overall encoding procedure and associated decoding procedure are presented. The encoding procedure involves at least two signal encoding processes (Sl, S4) operating on signal representations of a set of audio input channels. Local synthesis (S2) is used in connection with a first encoding process to generate a locally decoded signal, including a representation of the encoding error of the first encoding process. This locally decoded signal is applied as input (S3) to a second encoding process. The overall encoding procedure generates at least two residual encoding error signals (S5) from at least one of said encoding processes, including at least said second encoding process. The residual error signals are then subjected to compound residual encoding (S6) in a further encoding process, preferably based on correlation between the residual error signals.