Abstract:
A vocoder device and corresponding method characterizes and reconstructs speech excitation. An excitation analysis portion preforms a cyclic excitation transformation (52) process on a target excitation segment by rotating a peak amplitude to a beginning buffer location. The excitation phase representation is dealiased (54) using multiple dealiasing passes based on the phase slope variance. Both primary and secondary excitation components are characterized (56), where the secondary excitation is characterized based on a computation of the error (178) between the characterized primary excitation and the original excitation. Alternatively, an excitation pulse compression filter (50) is applied to the target, resulting in a symmetric target. The symmetric target is characterized (58) by normalizing half the symmetric target. The synthesis portion performs reconstruction (216, 218) and synthesis (220) of the characterized excitation based on the characterization method employed by the analysis portion.
Abstract:
A speech vocoder device and corresponding method parameterizes speech excitation waveforms. An analysis portion performs an excitation pulse compression process (46) which filters (64, 66, 68) an excitation template to produce compressed excitation from which excitation parameters are estimated (74). An optimal excitation target is selected using a closed-loop process (48) that selects the target based on a minimum error (158) between the original waveform and waveforms created by interpolating (156) between candidate targets. An adaptive excitation weighting function is created (178) based on the excitation target's features and a preselected characterization methodology, and the function is applied (180) to the excitation target. The excitation is characterized (52) and encoded (54) for digital transmission.
Abstract:
A speech vocoder device and corresponding method synthesizes speech excitation waveforms. The method entails reconstructing (216) an excitation target from decoded speech data, creating (220) aligned excitation segments by normalizing (296), correlating (298), and aligning (300) a source segment and a target segment, reconstructing normalized intervening segments by ensemble interpolating (318) between the source segment and the target segment, denormalizing (320) the normalized intervening segments, and reconstructing (322) an excitation waveform from the denormalized intervening segments, the source segment, and the target segment.