Abstract:
The invention utilizes low complexity estimates of complex functions to perform combinatorial coding of signal vectors. The invention disregards the accuracy of such functions as long as certain sufficient properties are maintained. The invention in turn may reduce computational complexity of certain coding and decoding operations by two orders of magnitude or more for a given signal vector input.
Abstract:
To address the need for choosing values of harmonic noise weighting (HNW) coefficient (.epsilon.p) so that the amount of harmonic noise weighting can be optimizex, a method and apparatus for performing harmonic noise weighting in digital spech coders is provided herein. During operation, received speech is analyzed (503) to determine a pitch period. HNW coefficients are then chosen (505) based on the pitch period, and a perceptual noise weighting filter (C(z)) is determined (507) based on the harmonic-noise weighting (HNW) coefficients (.epsilon.p).
Abstract:
In a speech encoder/decoder a pitch delay contour endpoint modifier is employed to shift the endpoints of a pitch delay interpolation curve up or down. Particularly, the endpoints of the pitch delay interpolation curve are shifted based on a variation and/or a standard deviation in pitch delay.
Abstract:
A speech coder that performs analysis-by-synthesis coding of a signal determines gain parameters for each constituent component of multiple constituent components of a synthetic excitation signal. The speech coder generates a target vector based on an input signal. The speech coder further generates multiple constituent components associated with the synthetic excitation signal, wherein one constituent component of the multiple constituent components is based on a shifted version of another constituent component of the multiple constituent components. The speech coder further evaluates an error criteria based on the target vector and the multiple constituent components to determine a gain associated with each constituent component of the multiple constituent components.
Abstract:
A method and apparatus for improving listener differentiation of talkers during a conference call is provided herein. Particularly, during a teleconference a node (101) will extend the bandwidth of received signals (e.g., speech). Each caller within the conference call will then have their voice projected by the node (101) to a particular spot in three-dimensional space.
Abstract:
The invention utilizes low complexity estimates of complex functions to perform combinatorial coding of signal vectors. The invention disregards the accuracy of such functions as long as certain sufficient properties are maintained. The invention in turn may reduce computational complexity of certain coding and decoding operations by two orders of magnitude or more for a given signal vector input.
Abstract:
A method (Fig. 9) and apparatus (500, 600) for prediction in a speech-coding system extends a 1st order long-term predictor (LTP) filter, using a sub-sample resolution delay, to a multi-tap LTP filter (504, 604). From another perspective, a conventional integer-sample resolution multi-tap LTP filter is extended to use sub-sample resolution delay. Such a multi-tap LTP filter offers a number of advantages over the prior-art. Particularly, defining the lag with sub-sample resolution makes it possible to explicitly model the delay values that have a fractional component, within the limits of resolution of the over-sampling factor used by the interpolation filter. The coefficients (ßi's) of the multi-tap LTP filter are thus largely freed from modeling the effect of delays that have a fractional component. Consequently their main function is to maximize the prediction gain of the LTP filter via modeling the degree of periodicity that is present and by imposing spectral shaping.
Abstract:
In a speech encoder/decoder (200/300) a pitch delay contour endpoint modifier (208) is employed to shift the endpoints of a pitch delay interpolation curve up or down. Particularly, the endpoints of the pitch delay interpolation curve are shifted based on a variation and/or a standard deviation in pitch delay.
Abstract:
A speech coder (300) that performs analysis-by-synthesis coding of a signal determines gain parameters for each constituent component of multiple constituent components of a synthetic excitation signal (ex(n)). The speech coder generates a target vector (p(n)) based on an input signal (s(n)). The speech coder further generates multiple constituent components associated with the synthetic excitation signal, wherein one constituent component of the multiple constituent components is based on a shifted version of another constituent component of the multiple constituent components. The speech coder further evaluates an error criteria based on the target vector and the multiple constituent components to determine a gain associated with each constituent component of the multiple constituent components.