Abstract:
A method and apparatus are provided for determining uncertainty in noise reduction based on a parametric model of speech distortion. The method is first used to reduce noise in a noisy signal. In particular, noise is reduced from a representation of a portion of a noisy signal to produce a representation of a cleaned signal by utilizing an acoustic environment model. The uncertainty associated with the noise reduction process is then computed. In one embodiment, the uncertainty of the noise reduction process is used, in conjunction with the noise-reduced signal, to decode a pattern state.
Abstract:
In a method for tracking pitch in a speech signal (200), first and second window vectors, xt, S¿t-p¿, are created from samples (414, 416, 418, 408, 410, 412) taken across first and second windows (402, 400) of the speech signal. The first window (402) is separated from the second window (400) by a test pitch period (406). The energy of the speech signal in the first window is combined with the correlation between the first window vector and the second window vector to produce a predictable energy factor. The predictable energy factor is then used to determine a pitch score for the test pitch period. Based in part on the pitch score, a portion of the pitch track is identified.
Abstract:
A warped spectral estimate of an original audio signal can be used to encode a representation of a fine estimate of the original signal. The representation of the warped spectral estimate and the representation of the fine estimate can be sent to a speech recognition system. The representation of the warped spectral estimate can be passed to a speech recognition engine, where it may be used for speech recognition. The representation of the warped spectral estimate can also be used along with the representation of the fine estimate to reconstruct a representation of the original audio signal.
Abstract:
A speech segment is indexed by identifying at least two alternative word sequences for the speech segment. For each word in the alternative sequences, information is placed in an entry for the word in the index. Speech units are eliminated from entries in the index based on a comparison of a probability that the word appears in the speech segment and a threshold value.
Abstract:
A method and apparatus to determine a channel response for an alternative sensor using an alternative sensor signal and an air conduction microphone signal (500). The channel response and a prior probabillity distuibution for clean speech valuse then used to estimate a clean speech value (502, 504, 506 and 508).