Abstract:
Perceptually relevant non-speech information can be preserved during encoding of an audio signal by determining whether the audio signal includes such information (122, 124, 125). If so, a speech/noise classification of the audio signal is overriden (43) to prevent misclassification of the audio signal as noise.
Abstract:
The quality of comfort noise generated by a speech decoder (93) during non-speech periods is improved by modifying (30, 75) comfort noise parameter values (33) normally used to generate the comfort noise. The comfort noise parameter values are modified in response to variability information (43) associated with a background noise parameter. The modified comfort noise parameter values (35) are then used to generate the comfort noise.
Abstract:
Cross talk terms may be removed from speech signals in a multi-party conference system having a plurality of users. In a preferred embodiment of the present invention, an estimator (210, 310) is used to first estimate the respective cross talk terms for each of the said plurality of users. These terms are then multiplied with a filter (230, 330) based on the estimated cross talk terms. Source speech signals are thereby recovered from the plurality of multi-party conference users without the cross talk.
Abstract:
A method and apparatus for partial redundancy encoding of a speech data packet is disclosed. The bits in the speech data packet (10) are sorted in a predetermined error sensitivity characteristic, order, level or degree of importance. Only those bits in the packet (10) which are considered to be most error sensitive are protected by redundant transmission. A partial set of redundant bits of the previously transmitted packets (10) are included with the data bit for current packet (10). The redundant bits are used at the receiver side to reconstruct damaged packets. By using only the most sensitive bits for redundancy, the additional required bandwith may be limited.
Abstract:
A linear predictive analysis-by-synthesis encoder includes a search algorithm block (50) and a vector quantizer (58) for vector quantizing optimal gains from a plurality of subframes in a frame. The internal encoder states are updated using (50, 52, 54, 56) the vector quantized gains.
Abstract:
A decoder improves delayed packet concealment in a packet network by using two decoder sections. A first decoder section (30) bases its decoding during the concealment phase on erroneous filter states and a set of speech parameters, whereas a second decoder section bases its decoding on saved (36) and updated filter states and the same speech parameters. The outputs of the two decoder sections are thereafter combined (34) to form the final speech signal. This decoding strategy produces a speech signal with smooth transitions from delayed to non-delayed packets and uses information from the most recent packets for speech generation.
Abstract:
The quality of comfort noise generated by a speech decoder (93) during non- speech periods is improved by modifying (30,75) comfort noise parameter values (33) normally used to generate the comfort noise. The comfort noise parameter values are modified in response to variability information (43) associated with a background noise parameter. The modified comfort noise parameter values (35) are then used to generate t he comfort noise.
Abstract:
The present invention advantageously provides a manner by which to further suppress noise superimposed upon an information signal without increasing distortion to the signal, e.g., speech. By distributing the noise suppression, the quality of the information signal provided to a listener is improved. In one embodiment, a first noise suppressor is employed at the transmitter to suppress noise superimposed upon an information signal prior to its transmission by the transmitter, and a second noise suppressor is employed at the receiver to suppress the noise component of a communication signal received at the receiver.
Abstract:
A decoder improves delayed packet concealment in a packet network by using two decoder sections. A first decoder section bases its decoding during the concealment phase on erroneous filter states and a set of speech parameters, whereas a second decoder section bases its decoding on saved and updated filter states and the same speech parameters. The outputs of the two decoder sections are thereafter combined to form the final speech signal. This decoding strategy produces a speech signal with smooth transitions from delayed to non-delayed packets and uses information from the most recent packets for speech generation.
Abstract:
A linear predictive analysis-by-synthesis encoder includes a search algorithm block (50) and a vector quantizer (58) for vector quantizing optimal gains from a plurality of subframes in a frame. The internal encoder states are updated (50, 52, 54, 56) using the vector quantized gains.