Abstract:
A method for animating an image is useful for animating avatars using real-time speech data. According to one aspect, the method includes identifying an upper facial part and a lower facial part of the image (step 705); animating the lower facial part based on speech data that are classified according to a reduced vowel set (step 710); tilting both the upper facial part and the lower facial part using a coordinate transformation model (step 715); and rotating both the upper facial part and the lower facial part using an image warping model (step 720).
Abstract:
A method and system for compressing handwritten character templates. The system includes a codebook generator module (105) for generating a codebook (125). The codebook (125) includes vectors defining the centers of clusters (115) of uncompressed model character feature vectors (110) provided from model character templates. A template compression module (120) is connected to the codebook generator module (105) for comparing the uncompressed model character feature vectors (110) with the codebook (125) to provide compressed templates of model characters (135). Optionally, a template matching module (140) is connected to the template compression module (120) for providing candidate characters (150) by comparing the distances between uncompressed input character feature vectors (130) and the model character templates.
Abstract:
A method and apparatus is provided for a low bit rate speech transmission. Speech spectral parameter vectors are generated from a voice message and stored in a sequence of speech spectral parameter vectors within a speech spectral parameter matrix. A first index identifying a first speech parameter template corresponding to a first speech spectral parameter vector of the sequence of speech spectral parameter vectors is transmitted. A subsequent speech spectral parameter vector of the sequence is selected and a subsequent speech parameter template is determined having a subsequent index. One or more intervening interpolated speech parameter templates are interpolated between the first speech parameter template and the subsequent speech parameter template. The one or more intervening speech spectral parameter vectors are compared to the corresponding one or more intervening interpolated speech parameter templates to derive a distance. The subsequent index is transmitted when the distance derived is less than or equal to a predetermined distance.
Abstract:
An apparatus and method for processing a voice message to provide low bit rate speech transmission processes the voice message to generate speech parameters which are arranged into a two dimensional parameter matrix (502) including a sequence of parameter frames. The two dimensional parameter matrix (502) is transformed using a predetermined two dimensional matrix transformation function (414) to obtain a two dimensional transform matrix (506). Distance values representing distances between templates of a set of predetermined templates and the two dimensional transform matrix (506) are then derived. The distance values derived are identified by indexes identifying the templates of the set of predetermined templates. The distance values derived are compared, and an index corresponding to a template of the set of predetermined templates having a shortest distance is selected and then transmitted.